cuGraph - RAPIDS GPU Graph Analytics

These details have not been verified by PyPI

Project links

Project description

RAPIDS cuGraph is a monorepo that represents a collection of packages focused on GPU-accelerated graph analytics, including support for property graphs, remote (graph as a service) operations, and graph neural networks (GNNs). cuGraph supports the creation and manipulation of graphs followed by the execution of scalable fast graph algorithms.

Getting cuGraph * Graph Algorithms * Graph Service * Property Graph * GNN Support

News

NEW! nx-cugraph, a NetworkX backend that provides GPU acceleration to NetworkX with zero code change.

> pip install nx-cugraph-cu11 --extra-index-url https://pypi.nvidia.com
> export NETWORKX_AUTOMATIC_BACKENDS=cugraph

That's it. NetworkX now leverages cuGraph for accelerated graph algorithms.

Installation
General
Packages
API Docs
- Python
  - Python Nightly
  - Python Stable
- C++
  - C++ Nightly
  - C++ Stable
References
- RAPIDS
- ARROW
- DASK

RAPIDS cuGraph is a collection of GPU-accelerated graph algorithms and services. At the Python layer, cuGraph operates on GPU DataFrames, thereby allowing for seamless passing of data between ETL tasks in cuDF and machine learning tasks in cuML. Data scientists familiar with Python will quickly pick up how cuGraph integrates with the Pandas-like API of cuDF. Likewise, users familiar with NetworkX will quickly recognize the NetworkX-like API provided in cuGraph, with the goal to allow existing code to be ported with minimal effort into RAPIDS. To simplify integration, cuGraph also supports data found in Pandas DataFrame, NetworkX Graph Objects and several other formats.

While the high-level cugraph python API provides an easy-to-use and familiar interface for data scientists that's consistent with other RAPIDS libraries in their workflow, some use cases require access to lower-level graph theory concepts. For these users, we provide an additional Python API called pylibcugraph, intended for applications that require a tighter integration with cuGraph at the Python layer with fewer dependencies. Users familiar with C/C++/CUDA and graph structures can access libcugraph and libcugraph_c for low level integration outside of python.

NOTE: For the latest stable README.md ensure you are on the latest branch.

As an example, the following Python snippet loads graph data and computes PageRank:

import cudf
import cugraph

# read data into a cuDF DataFrame using read_csv
gdf = cudf.read_csv("graph_data.csv", names=["src", "dst"], dtype=["int32", "int32"])

# We now have data as edge pairs
# create a Graph using the source (src) and destination (dst) vertex pairs
G = cugraph.Graph()
G.from_cudf_edgelist(gdf, source='src', destination='dst')

# Let's now get the PageRank score of each vertex by calling cugraph.pagerank
df_page = cugraph.pagerank(G)

# Let's look at the top 10 PageRank Score
df_page.sort_values('pagerank', ascending=False).head(10)

Why cuGraph does not support Method Cascading

Projects that use cuGraph

(alphabetical order)

ArangoDB - a free and open-source native multi-model database system - https://www.arangodb.com/
CuPy - "NumPy/SciPy-compatible Array Library for GPU-accelerated Computing with Python" - https://cupy.dev/
Memgraph - In-memory Graph database - https://memgraph.com/
NetworkX (via nx-cugraph backend) - an extremely popular, free and open-source package for the creation, manipulation, and study of the structure, dynamics, and functions of complex networks - https://networkx.org/
PyGraphistry - free and open-source GPU graph ETL, AI, and visualization, including native RAPIDS & cuGraph support - http://github.com/graphistry/pygraphistry
ScanPy - a scalable toolkit for analyzing single-cell gene expression data - https://scanpy.readthedocs.io/en/stable/

(please post an issue if you have a project to add to this list)

Open GPU Data Science

The RAPIDS suite of open source software libraries aims to enable execution of end-to-end data science and analytics pipelines entirely on GPUs. It relies on NVIDIA® CUDA® primitives for low-level compute optimization but exposing that GPU parallelism and high-bandwidth memory speed through user-friendly Python interfaces.

For more project details, see rapids.ai.

Apache Arrow on GPU

The GPU version of Apache Arrow is a common API that enables efficient interchange of tabular data between processes running on the GPU. End-to-end computation on the GPU avoids unnecessary copying and converting of data off the GPU, reducing compute time and cost for high-performance analytics common in artificial intelligence workloads. As the name implies, cuDF uses the Apache Arrow columnar data format on the GPU. Currently, a subset of the features in Apache Arrow are supported.

Project details

These details have not been verified by PyPI

Project links

Release history Release notifications | RSS feed

This version

24.10.0

Oct 10, 2024

24.8.0

Aug 8, 2024

24.6.1

Jun 14, 2024

24.6.0

Jun 6, 2024

24.4.0

Apr 12, 2024

24.2.0

Feb 13, 2024

23.12.0

Dec 7, 2023

23.10.0

Oct 11, 2023

23.8.0

Aug 10, 2023

23.6.2

Jun 14, 2023

23.6.1

Jun 9, 2023

23.6.0

Jun 9, 2023

23.4.1

Apr 21, 2023

23.4.0.1681369743

Apr 13, 2023

23.4.0

Apr 20, 2023

23.2.0

Feb 10, 2023

0.0.1 yanked

Sep 14, 2022

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

cugraph_cu11-24.10.0.tar.gz (4.0 kB view details)

Uploaded Oct 10, 2024 Source

File details

Details for the file cugraph_cu11-24.10.0.tar.gz.

File metadata

Download URL: cugraph_cu11-24.10.0.tar.gz
Upload date: Oct 10, 2024
Size: 4.0 kB
Tags: Source
Uploaded using Trusted Publishing? No
Uploaded via: twine/5.0.0 CPython/3.10.12

File hashes

Hashes for cugraph_cu11-24.10.0.tar.gz
Algorithm	Hash digest
SHA256	`6ad891d1bb65807dcb2cf48a389f3e0feb6d23eb6ed0d8cce4303f1e4225de19`
MD5	`4c03e7bd5d21b186984456c64b47116b`
BLAKE2b-256	`9900097e3a0bdc5f725c2197caef9a66e647e4b1673209297bb594f3484626a8`