Lightweight static analysis for many languages. Find bug variants with patterns that look like source code.
Project description
Code scanning at ludicrous speed.
This repository contains the source code for Semgrep OSS (open-source software). Semgrep OSS is a fast, open-source, static analysis tool for searching code, finding bugs, and enforcing code standards at editor, commit, and CI time. Semgrep is a semantic grep for code: where grep "2"
would only match the exact string 2, Semgrep would match x = 1; y = x + 1
when searching for 2. And it does this in 30+ languages! Semgrep rules look like the code you already write; no abstract syntax trees, regex wrestling, or painful DSLs: read more below.
For companies who need SAST, SCA, and Secret scanning, we provide a product suite on top of Semgrep OSS that scans code and package dependencies for known issues, software vulnerabilities, and finds secrets with high accuracy:
- Semgrep Code to find bugs & vulnerabilities using the deeper, interfile-analysis enabled Pro engine and high-accuracy Pro rules in addition to the community rules
- Semgrep Supply Chain to find dependencies with known vulnerabilities function-level reachability analysis
- Semgrep Secrets to find hard-coded credentials that shouldn't be checked into source code
Semgrep analyzes code locally on your computer or in your build environment: by default, code is never uploaded. Get started →.
Language support
Semgrep Code supports 30+ languages, including:
Apex · Bash · C · C++ · C# · Clojure · Dart · Dockerfile · Elixir · HTML · Go · Java · JavaScript · JSX · JSON · Julia · Jsonnet · Kotlin · Lisp · Lua · OCaml · PHP · Python · R · Ruby · Rust · Scala · Scheme · Solidity · Swift · Terraform · TypeScript · TSX · YAML · XML · Generic (ERB, Jinja, etc.)
Semgrep Supply Chain supports 12 languages across 15 package managers, including:
C# (NuGet) · Dart (Pub) · Go (Go modules, go mod
) · Java (Gradle, Maven) · Javascript/Typescript (npm, Yarn, Yarn 2, Yarn 3, pnpm) · Kotlin (Gradle, Maven) · PHP (Composer) · Python (pip, pip-tool, Pipenv, Poetry) · Ruby (RubyGems) · Rust (Cargo) · Scala (Maven) · Swift (SwiftPM)
For more information, see Supported languages.
Getting started 🚀
For new users, we recommend starting with the Semgrep AppSec Platform because it provides a visual interface, a demo project, result triaging and exploration workflows, and makes setup in CI/CD fast. Scans are still local and code isn't uploaded. Alternatively, you can also start with the CLI and navigate the terminal output to run one-off searches.
Option 1: Getting started from the Semgrep Appsec Platform (Recommended)
-
Register on semgrep.dev
-
Explore the demo findings to learn how Semgrep works
-
Scan your project by navigating to
Projects > Scan New Project > Run scan in CI
-
Select your version control system and follow the onboarding steps to add your project. After this setup, Semgrep will scan your project after every pull request.
-
[Optional] If you want to run Semgrep locally, follow the steps in the CLI section.
Notes:
If there are any issues, please ask for help in the Semgrep Slack.
Option 2: Getting started from the CLI
- Install Semgrep CLI
# For macOS
$ brew install semgrep
# For Ubuntu/WSL/Linux/macOS
$ python3 -m pip install semgrep
# To try Semgrep without installation run via Docker
$ docker run -it -v "${PWD}:/src" semgrep/semgrep semgrep login
$ docker run -e SEMGREP_APP_TOKEN=<TOKEN> --rm -v "${PWD}:/src" semgrep/semgrep semgrep ci
- Run
semgrep login
to create your account and login to Semgrep.
Logging into Semgrep gets you access to:
- Semgrep Supply Chain: A dependency scanner that detects reachable vulnerabilities in third party libraries
- Semgrep Code's Pro rules: 600+ high confidence rules written by Semgrep's security research team
- Semgrep Code's Pro engine: An advanced code analysis engine, designed to detect complex vulnerabilities, and reduce false positives
-
Go to your app's root directory and run
semgrep ci
. This will scan your project to check for vulnerabilities in your source code and its dependencies. -
Try writing your own query interactively with
-e
. For example, a check for Python == where the left and right hand sides are the same (potentially a bug):$ semgrep -e '$X == $X' --lang=py path/to/src
Semgrep Ecosystem
The Semgrep ecosystem includes the following products:
- Semgrep Code - Scan your code with Semgrep's proprietary rules (written by our Security Research team) using our cross-file and cross-function analysis. Designed to find OWASP Top 10 vulnerabilities and protect against critical security risks. Semgrep Code is available on both free and paid tiers.
- Semgrep Supply Chain (SSC) - A high-signal dependency scanner that detects reachable vulnerabilities in open source third-party libraries and functions across the software development life cycle (SDLC). Semgrep Supply Chain is available on both free and paid tiers.
- Semgrep Secrets [NEW!] - Secrets detection that uses semantic analysis, improved entropy analysis, and validation together to accurately detect sensitive credentials in developer workflows. Book a demo to request early access to the product.
- Semgrep AppSec Platform - Deploy, manage, and monitor Semgrep at scale, with free and paid tiers. Integrates with continuous integration (CI) providers such as GitHub, GitLab, CircleCI, and more.
- Semgrep OSS Engine - The open-source engine and community-contributed rules at the heart of everything (this project).
To learn more about Semgrep, visit:
- Semgrep Playground - An online interactive tool for writing and sharing rules.
- Semgrep Registry - 2,000+ community-driven rules covering security, correctness, and dependency vulnerabilities.
Join hundreds of thousands of other developers and security engineers already using Semgrep at companies like GitLab, Dropbox, Slack, Figma, Shopify, HashiCorp, Snowflake, and Trail of Bits.
Semgrep is developed and commercially supported by Semgrep, Inc., a software security company.
Semgrep Rules
Semgrep rules look like the code you already write; no abstract syntax trees, regex wrestling, or painful DSLs. Here's a quick rule for finding Python print()
statements.
Run it online in Semgrep’s Playground by clicking here.
Examples
Visit Docs > Rule examples for use cases and ideas.
Use case | Semgrep rule |
---|---|
Ban dangerous APIs | Prevent use of exec |
Search routes and authentication | Extract Spring routes |
Enforce the use secure defaults | Securely set Flask cookies |
Tainted data flowing into sinks | ExpressJS dataflow into sandbox.run |
Enforce project best-practices | Use assertEqual for == checks, Always check subprocess calls |
Codify project-specific knowledge | Verify transactions before making them |
Audit security hotspots | Finding XSS in Apache Airflow, Hardcoded credentials |
Audit configuration files | Find S3 ARN uses |
Migrate from deprecated APIs | DES is deprecated, Deprecated Flask APIs, Deprecated Bokeh APIs |
Apply automatic fixes | Use listenAndServeTLS |
Extensions
Visit Docs > Extensions to learn about using Semgrep in your editor or pre-commit. When integrated into CI and configured to scan pull requests, Semgrep will only report issues introduced by that pull request; this lets you start using Semgrep without fixing or ignoring pre-existing issues!
Documentation
Browse the full Semgrep documentation on the website. If you’re new to Semgrep, check out Docs > Getting started or the interactive tutorial.
Metrics
Using remote configuration from the Registry (like --config=p/ci
) reports pseudonymous rule metrics to semgrep.dev.
Using configs from local files (like --config=xyz.yml
) does not enable metrics.
To disable Registry rule metrics, use --metrics=off
.
The Semgrep privacy policy describes the principles that guide data-collection decisions and the breakdown of the data that are and are not collected when the metrics are enabled.
More
- Frequently asked questions (FAQs)
- Contributing
- Build instructions for developers
- Ask questions in the Semgrep community Slack
- CLI reference and exit codes
- Semgrep YouTube channel
- License (LGPL-2.1)
- Licensing Semgrep
Upgrading
To upgrade, run the command below associated with how you installed Semgrep:
# Using Homebrew
$ brew upgrade semgrep
# Using pip
$ python3 -m pip install --upgrade semgrep
# Using Docker
$ docker pull semgrep/semgrep:latest
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distributions
File details
Details for the file semgrep-1.88.0.tar.gz
.
File metadata
- Download URL: semgrep-1.88.0.tar.gz
- Upload date:
- Size: 27.3 MB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/5.1.1 CPython/3.12.6
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | 6ed08e4deeae5734e7e603aa4196c6c287bb2c1acd16e0a80dce6c1e52850d43 |
|
MD5 | c693d2729e37f2d5e6c9453cf4bac572 |
|
BLAKE2b-256 | 43ab092dacaa38d9238cf84f597efeefacc0df435ca1a0a10cf68ecd7de2c561 |
File details
Details for the file semgrep-1.88.0-cp38.cp39.cp310.cp311.py37.py38.py39.py310.py311-none-musllinux_1_0_aarch64.manylinux2014_aarch64.whl
.
File metadata
- Download URL: semgrep-1.88.0-cp38.cp39.cp310.cp311.py37.py38.py39.py310.py311-none-musllinux_1_0_aarch64.manylinux2014_aarch64.whl
- Upload date:
- Size: 32.3 MB
- Tags: CPython 3.10, CPython 3.11, CPython 3.8, CPython 3.9, Python 3.10, Python 3.11, Python 3.7, Python 3.8, Python 3.9, musllinux: musl 1.0+ ARM64
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/5.1.1 CPython/3.12.6
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | e7ec004e860b39c2fe7214d04dff696988616dce5e9c48e87651821fb75c9d74 |
|
MD5 | 1913a8d053f616623c96a788c5dcca2a |
|
BLAKE2b-256 | 8cb288dbf47eac5f917d190d117f10a6b096bac849309f63d764fd3de44c3aa3 |
File details
Details for the file semgrep-1.88.0-cp38.cp39.cp310.cp311.py37.py38.py39.py310.py311-none-macosx_11_0_arm64.whl
.
File metadata
- Download URL: semgrep-1.88.0-cp38.cp39.cp310.cp311.py37.py38.py39.py310.py311-none-macosx_11_0_arm64.whl
- Upload date:
- Size: 33.4 MB
- Tags: CPython 3.10, CPython 3.11, CPython 3.8, CPython 3.9, Python 3.10, Python 3.11, Python 3.7, Python 3.8, Python 3.9, macOS 11.0+ ARM64
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/5.1.1 CPython/3.12.6
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | d3521367a28b36eaeafa9347f3c2ce88a80d25a716e380aeeca8257bd84c5d9a |
|
MD5 | a4f674cea70b7a7833a55c705758fb4e |
|
BLAKE2b-256 | 8097e16c9ebcb23057b849de53903bc254507977055806beeaba08a24e677ee6 |
File details
Details for the file semgrep-1.88.0-cp38.cp39.cp310.cp311.py37.py38.py39.py310.py311-none-macosx_10_14_x86_64.whl
.
File metadata
- Download URL: semgrep-1.88.0-cp38.cp39.cp310.cp311.py37.py38.py39.py310.py311-none-macosx_10_14_x86_64.whl
- Upload date:
- Size: 27.7 MB
- Tags: CPython 3.10, CPython 3.11, CPython 3.8, CPython 3.9, Python 3.10, Python 3.11, Python 3.7, Python 3.8, Python 3.9, macOS 10.14+ x86-64
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/5.1.1 CPython/3.12.6
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | 3d9e653a0e1fd65cff8208be7d4ea607cbf4ae317f84d8cee83b9fa863378ce8 |
|
MD5 | 4372062680a174da8e9ef111b694d9ae |
|
BLAKE2b-256 | 7161d6d563174775df23ab7f6e7a79e2444c1b17108715515595caa6ec9529d2 |
File details
Details for the file semgrep-1.88.0-cp38.cp39.cp310.cp311.py37.py38.py39.py310.py311-none-any.whl
.
File metadata
- Download URL: semgrep-1.88.0-cp38.cp39.cp310.cp311.py37.py38.py39.py310.py311-none-any.whl
- Upload date:
- Size: 27.7 MB
- Tags: CPython 3.10, CPython 3.11, CPython 3.8, CPython 3.9, Python 3.10, Python 3.11, Python 3.7, Python 3.8, Python 3.9
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/5.1.1 CPython/3.12.6
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | 6d207d31bd2a0dd59b81a54d62b7834f39a4ba4161f8d0fc930ba2ccd4458303 |
|
MD5 | b1517bf48419116799cd73870e0101b5 |
|
BLAKE2b-256 | 2a45c816bd9ab35def8e6a011f421ec88ac11c33beefbafb7f2a1036abf13d6f |