NVIDIA cuTENSOR
Project description
cuTENSOR is a high-performance CUDA library for tensor primitives.
Key Features
Extensive mixed-precision support:
FP64 inputs with FP32 compute.
FP32 inputs with FP16, BF16, or TF32 compute.
Complex-times-real operations.
Conjugate (without transpose) support.
Support for up to 64-dimensional tensors.
Arbitrary data layouts.
Trivially serializable data structures.
Main computational routines:
Direct (i.e., transpose-free) tensor contractions.
Tensor reductions (including partial reductions).
Element-wise tensor operations:
Support for various activation functions.
Arbitrary tensor permutations.
Conversion between different data types.
Documentation
Please refer to https://docs.nvidia.com/cuda/cutensor/index.html for the cuTENSOR documentation.
Installation
The cuTENSOR wheel can be installed as follows:
pip install cutensor-cuXX
where XX is the CUDA major version (currently CUDA 11 & 12 are supported). The package cutensor (without the -cuXX suffix) is deprecated. If you have cutensor installed, please remove it prior to installing cutensor-cuXX.
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distributions
Built Distributions
Hashes for cutensor_cu11-1.7.0-py3-none-win_amd64.whl
Algorithm | Hash digest | |
---|---|---|
SHA256 | d173b3d0fd51cf761b371a4d4be9a3afd3ef230a55ae4336ae31e905336480e1 |
|
MD5 | 5ee53eea07a7373048a4e1295587e60b |
|
BLAKE2b-256 | 571ba84ff9b9743bb188921264d971cc930d1a56bdc2ef4d8bd6016be19ad91e |
Hashes for cutensor_cu11-1.7.0-py3-none-manylinux2014_x86_64.whl
Algorithm | Hash digest | |
---|---|---|
SHA256 | 67b6c7427d9ab50cb82e01360948bd1b23d73775b5767ab92071c7afcfec4b8b |
|
MD5 | c80ad666378ca94c227da23a9dcd1b75 |
|
BLAKE2b-256 | 5d70b6fed26644820d8b10e19f2bdd4fb45e20b352422eebfff91130e5ee8a6a |
Hashes for cutensor_cu11-1.7.0-py3-none-manylinux2014_aarch64.whl
Algorithm | Hash digest | |
---|---|---|
SHA256 | c5598670f4f31906d725f5ea852f0df675522e3ff5a7bf886057eab36497062d |
|
MD5 | 6cd9075db9a2477c9c1e06073d7b8d74 |
|
BLAKE2b-256 | 281803c7c1dc9f6e2b49c589b2570830c205cc3c0e0110b8f39e556a646e1bdb |