Skip to main content

ScanCode is a tool to scan code for license, copyright, package and their documented dependencies and other interesting facts.

Project description

A typical software project often reuses hundreds of third-party packages. License and packages, dependencies and origin information is not always easy to find and not normalized: ScanCode discovers and normalizes this data for you.

Read more about ScanCode here: https://scancode-toolkit.readthedocs.io/.

Check out the code at https://github.com/nexB/scancode-toolkit

Discover also:

Build and tests status

We run 30,000+ tests on each commit on multiple CIs to ensure a good platform compabitility with multiple versions of Windows, Linux and macOS.

Azure

RTD Build

GitHub actions Docs

GitHub actions Release

Azure tests status (Linux, macOS, Windows)

Documentation Status

Documentation Tests

Release tests

Why use ScanCode?

  • As a standalone command-line tool, ScanCode is easy to install, run, and embed in your CI/CD processing pipeline. It runs on Windows, macOS, and Linux.

  • ScanCode is used by several projects and organizations such as the Eclipse Foundation, OpenEmbedded.org, the FSFE, the FSF, OSS Review Toolkit, ClearlyDefined.io, RedHat Fabric8 analytics, and many more.

  • ScanCode detects licenses, copyrights, package manifests, direct dependencies, and more both in source code and binary files and is considered as the best-in-class and reference tool in this domain, re-used as the core tools for software composition data collection by several open source tools.

  • ScanCode provides the most accurate license detection engine and does a full comparison (also known as diff or red line comparison) between a database of license texts and your code instead of relying only on approximate regex patterns or probabilistic search, edit distance or machine learning.

  • Written in Python, ScanCode is easy to extend with plugins to contribute new and improved scanners, data summarization, package manifest parsers, and new outputs.

  • You can save your scan results as JSON, YAML, HTML, CycloneDX or SPDX or even create your own format with Jinja templates.

  • You can also organize and run ScanCode server-side with the companion ScanCode.io web app to organize and store multiple scan projects including scripted scanning pipelines.

  • ScanCode output data can be easily visualized and analysed using the ScanCode Workbench desktop app.

  • ScanCode is actively maintained, has a growing users and contributors community.

  • ScanCode is heavily tested with an automated test suite of over 20,000 tests.

  • ScanCode has an extensive and growing documentation.

  • ScanCode can process packages, build manifest and lockfile formats to collect Package URLs and extract metadata: Alpine packages, BUCK files, ABOUT files, Android apps, Autotools, Bazel, JavaScript Bower, Java Axis, MS Cab, Rust Cargo, Cocoapods, Chef Chrome apps, PHP Composer and composer.lock, Conda, CPAN, Debian, Apple dmg, Java EAR, WAR, JAR, FreeBSD packages, Rubygems gemspec, Gemfile and Gemfile.lock, Go modules, Haxe packages, InstallShield installers, iOS apps, ISO images, Apache IVY, JBoss Sar, R CRAN, Apache Maven, Meteor, Mozilla extensions, MSI installers, JavaScript npm packages, package-lock.json, yarn.lock, NSIS Installers, NugGet, OPam, Cocoapods, Python PyPI setup.py, setup.cfg, and several related lockfile formats, semi structured README files such as README.android, README.chromium, README.facebook, README.google, README.thirdparty, RPMs, Shell Archives, Squashfs images, Java WAR, Windows executables and the Windows registry and a few more. See all available package parsers for the exhaustive list.

See our roadmap for upcoming features.

Documentation

The ScanCode documentation is hosted at scancode-toolkit.readthedocs.io.

If you are new to visualization of scancode results data, start with our newcomer page.

If you want to compare output changes between different versions of ScanCode, or want to look at scans generated by ScanCode, review our reference scans.

Other Important Documentation Pages:

See also https://aboutcode.org for related companion projects and tools.

Installation

Before installing ScanCode make sure that you have installed the prerequisites properly. This means installing Python 3.8 for x86/64 architectures. We support Python 3.8, 3.9, 3.10 and 3.11.

See prerequisites for detailed information on the support platforms and Python versions.

There are a few common ways to install ScanCode.

Quick Start

After ScanCode is installed successfully you can run an example scan printed on screen as JSON:

scancode -clip --json-pp - samples

Follow the How to Run a Scan tutorial to perform a basic scan on the samples directory distributed by default with ScanCode.

See more command examples:

scancode --examples

See How to select what will be detected in a scan and How to specify the output format for more information.

You can also refer to the command line options synopsis and an exhaustive list of all available command line options.

Archive extraction

By default ScanCode does not extract files from tarballs, zip files, and other archives as part of the scan. The archives that exist in a codebase must be extracted before running a scan: extractcode is a bundled utility behaving as a mostly-universal archive extractor. For example, this command will recursively extract the mytar.tar.bz2 tarball in the mytar.tar.bz2-extract directory:

./extractcode mytar.tar.bz2

See all extractcode options and how to extract archives for details.

Support

If you have a problem, a suggestion or found a bug, please enter a ticket at: https://github.com/nexB/scancode-toolkit/issues

For discussions and chats, we have:

  • an official Gitter channel for web-based chats. Gitter is now accessible through Element or an IRC bridge. There are other AboutCode project-specific channels available there too.

  • The discussion channel for scancode specifically aimed at users and developers using scancode-toolkit.

Source code and downloads

License

  • Apache-2.0 as the overall license

  • CC-BY-4.0 for reference datasets (initially was in the Public Domain).

  • Multiple other secondary permissive or copyleft licenses (LGPL, MIT, BSD, GPL 2/3, etc.) for third-party components and test suite code and data.

See the NOTICE file and the .ABOUT files that document the origin and license of the third-party code used in ScanCode for more details.

Project details


Release history Release notifications | RSS feed

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

scancode-toolkit-32.0.3.tar.gz (14.4 MB view details)

Uploaded Source

Built Distributions

scancode_toolkit-32.0.3-cp311-none-any.whl (104.8 MB view details)

Uploaded CPython 3.11

scancode_toolkit-32.0.3-cp310-none-any.whl (104.8 MB view details)

Uploaded CPython 3.10

scancode_toolkit-32.0.3-cp39-none-any.whl (104.8 MB view details)

Uploaded CPython 3.9

scancode_toolkit-32.0.3-cp38-none-any.whl (104.8 MB view details)

Uploaded CPython 3.8

scancode_toolkit-32.0.3-cp37-none-any.whl (104.8 MB view details)

Uploaded CPython 3.7

File details

Details for the file scancode-toolkit-32.0.3.tar.gz.

File metadata

  • Download URL: scancode-toolkit-32.0.3.tar.gz
  • Upload date:
  • Size: 14.4 MB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/4.0.2 CPython/3.9.16

File hashes

Hashes for scancode-toolkit-32.0.3.tar.gz
Algorithm Hash digest
SHA256 b2b5d98f9597cbfa6843b115be8c11927ff999a6dfbbd92dd7ec31b997b20767
MD5 ee6807fe4960676375e842fd0fea473e
BLAKE2b-256 88f1e634cae864d731f7e65f18ab33a45ea535847d7d5002477db7b25e123e9e

See more details on using hashes here.

Provenance

File details

Details for the file scancode_toolkit-32.0.3-cp311-none-any.whl.

File metadata

File hashes

Hashes for scancode_toolkit-32.0.3-cp311-none-any.whl
Algorithm Hash digest
SHA256 573f8dc724a74d6aec42948f247c780ddebd446a00dc53cdbb316ec65067cc9f
MD5 33f4c579bc9261b747dec04c7498a65e
BLAKE2b-256 ed18e1b5ca9637c4828a9b3f9180324af49ad5e3e633f46fb9e0e278df5d6804

See more details on using hashes here.

Provenance

File details

Details for the file scancode_toolkit-32.0.3-cp310-none-any.whl.

File metadata

File hashes

Hashes for scancode_toolkit-32.0.3-cp310-none-any.whl
Algorithm Hash digest
SHA256 89028bff6fd0b5495a48671fc5e82f4a4a71841170c32966c49f8a4176c02b67
MD5 083db76d1c988e0019079a96e1ceafbc
BLAKE2b-256 d63e3feaa72bec23a910c3b787eed5d32489129f722328653d61e2fee6df5f11

See more details on using hashes here.

Provenance

File details

Details for the file scancode_toolkit-32.0.3-cp39-none-any.whl.

File metadata

File hashes

Hashes for scancode_toolkit-32.0.3-cp39-none-any.whl
Algorithm Hash digest
SHA256 628c138480b358ae6e2cb537fa99b7eb13bc3bca4452ea0ef3c01bff4040e30a
MD5 a518d9715a8aac6a8633a4427254c71b
BLAKE2b-256 1d3f9ac6062905282efbc7ec2fac878a54294fd246b33c67d781a85cd75d8d22

See more details on using hashes here.

Provenance

File details

Details for the file scancode_toolkit-32.0.3-cp38-none-any.whl.

File metadata

File hashes

Hashes for scancode_toolkit-32.0.3-cp38-none-any.whl
Algorithm Hash digest
SHA256 f6c01bf6b66bd6ebda93a03260df0015703aeed8ce2b771a472e71210aeacab0
MD5 5faf47b239f127a135e59f71e5b271ad
BLAKE2b-256 59bfd8d817a95bb0af9b3f82f60df54784b42eaebc46fb271ee316be6ddb5793

See more details on using hashes here.

Provenance

File details

Details for the file scancode_toolkit-32.0.3-cp37-none-any.whl.

File metadata

File hashes

Hashes for scancode_toolkit-32.0.3-cp37-none-any.whl
Algorithm Hash digest
SHA256 b0ae7a6f0853cf32975e03fd5090e51384313785527b3a178237e89d2cbac746
MD5 e7ad2f9e691a43328f2a9db70d69f442
BLAKE2b-256 dfb51e2c09fec0215a61ae3018d3674cd212f3e2ed15554ea17a9c4b467af67c

See more details on using hashes here.

Provenance

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page