NVIDIA cuTENSOR
Project description
cuTENSOR is a high-performance CUDA library for tensor primitives.
Key Features
Extensive mixed-precision support:
FP64 inputs with FP32 compute.
FP32 inputs with FP16, BF16, or TF32 compute.
Complex-times-real operations.
Conjugate (without transpose) support.
Support for up to 64-dimensional tensors.
Arbitrary data layouts.
Trivially serializable data structures.
Main computational routines:
Direct (i.e., transpose-free) tensor contractions.
Tensor reductions (including partial reductions).
Element-wise tensor operations:
Support for various activation functions.
Arbitrary tensor permutations.
Conversion between different data types.
Documentation
Please refer to https://docs.nvidia.com/cuda/cutensor/index.html for the cuTENSOR documentation.
Installation
The cuTENSOR wheel can be installed as follows:
pip install cutensor-cuXX
where XX is the CUDA major version (currently CUDA 11 & 12 are supported). The package cutensor (without the -cuXX suffix) is deprecated. If you have cutensor installed, please remove it prior to installing cutensor-cuXX.
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distributions
Built Distributions
Hashes for cutensor_cu12-1.7.0-py3-none-win_amd64.whl
Algorithm | Hash digest | |
---|---|---|
SHA256 | e1a9a759a615a64d1b8c6d2b8ffd925deb805750c28481b1a8310d05f35ce229 |
|
MD5 | 318d67a255ae35b0d7d7a9fc56d2e62d |
|
BLAKE2b-256 | ed8107d73da9c7f1c19e7acc748e5b9c27d37c1ec544225937f67a82dc94499b |
Hashes for cutensor_cu12-1.7.0-py3-none-manylinux2014_x86_64.whl
Algorithm | Hash digest | |
---|---|---|
SHA256 | 29bdde551788fd3a611992026a5bb422831069e38fd44ab920af5aa00cffa12c |
|
MD5 | 714780b13f6093c62e8cb5cf8b046cb2 |
|
BLAKE2b-256 | 604281bdc77ba8fe05357e20baf274bd26e8b6774fd6554dd07958f526bf8059 |
Hashes for cutensor_cu12-1.7.0-py3-none-manylinux2014_aarch64.whl
Algorithm | Hash digest | |
---|---|---|
SHA256 | 515caa2406e09ffe9c6524328b7da2106169811665f7684836052753a30dda27 |
|
MD5 | 8cf9fe7bf34b143910bbed1948dce7b3 |
|
BLAKE2b-256 | 91f2fccc9d610bc92b0ca050ecaede0e16b78cd514825511e00654ecda6a45f4 |