cutensor-cu11

NVIDIA cuTENSOR

These details have been verified by PyPI

Maintainers

JeremyWangNVDA leofang mhohnerbach_nvidia mtjrider

These details have not been verified by PyPI

Project links

Homepage

Project description

cuTENSOR is a high-performance CUDA library for tensor primitives.

Key Features

Extensive mixed-precision support:
- FP64 inputs with FP32 compute.
- FP32 inputs with FP16, BF16, or TF32 compute.
- Complex-times-real operations.
- Conjugate (without transpose) support.
Support for up to 64-dimensional tensors.
Arbitrary data layouts.
Trivially serializable data structures.
Main computational routines:
- Direct (i.e., transpose-free) tensor contractions.
  - Support just-in-time compilation of dedicated kernels.
- Tensor reductions (including partial reductions).
- Element-wise tensor operations:
  - Support for various activation functions.
  - Support for padding of the output tensor
  - Arbitrary tensor permutations.
  - Conversion between different data types.

Documentation

Please refer to https://docs.nvidia.com/cuda/cutensor/index.html for the cuTENSOR documentation.

Installation

The cuTENSOR wheel can be installed as follows:

pip install cutensor-cuXX

where XX is the CUDA major version (currently CUDA 11 & 12 are supported). The package cutensor (without the -cuXX suffix) is deprecated. If you have cutensor installed, please remove it prior to installing cutensor-cuXX.

Project details

These details have been verified by PyPI

Maintainers

JeremyWangNVDA leofang mhohnerbach_nvidia mtjrider

These details have not been verified by PyPI

Project links

Homepage

Release history Release notifications | RSS feed

2.0.2

Jul 9, 2024

2.0.1

Feb 8, 2024

This version

2.0.0

Nov 21, 2023

1.7.0

Apr 6, 2023

1.6.2

Jan 5, 2023

1.6.1

Dec 1, 2022

1.6.0

Nov 14, 2022

0.0.1.dev0 pre-release yanked

Oct 7, 2022

Reason this release was yanked:

placeholder

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distributions

No source distribution files available for this release.See tutorial on generating distribution archives.

Built Distributions

cutensor_cu11-2.0.0-py3-none-win_amd64.whl (137.9 MB view hashes)

Uploaded Nov 21, 2023 Python 3 Windows x86-64

cutensor_cu11-2.0.0-py3-none-manylinux2014_x86_64.whl (143.8 MB view hashes)

Uploaded Nov 21, 2023 Python 3

cutensor_cu11-2.0.0-py3-none-manylinux2014_aarch64.whl (143.6 MB view hashes)

Uploaded Nov 21, 2023 Python 3

Hashes for cutensor_cu11-2.0.0-py3-none-win_amd64.whl

Hashes for cutensor_cu11-2.0.0-py3-none-win_amd64.whl
Algorithm	Hash digest
SHA256	`d82d66cac2b6e9264a38789e1dda978e5071c3da448b75427592be281ab957aa`
MD5	`7933741ee61dbe1fe3f9a9fbaa04952e`
BLAKE2b-256	`15c01cb0db3d17a1ff2c5a158dab9c481c75c7e6080dfb20f2a38de72c4dd403`

Hashes for cutensor_cu11-2.0.0-py3-none-manylinux2014_x86_64.whl

Hashes for cutensor_cu11-2.0.0-py3-none-manylinux2014_x86_64.whl
Algorithm	Hash digest
SHA256	`d43c6a4cc74bd167913689b16b654ef0642b4ae39adb2ee11eb4fb46e1e3fb29`
MD5	`972d84b7bebb55c0516e1d33728730ed`
BLAKE2b-256	`da3b01e07fa45cc30012b560401883522960baa99c4f3f2ccdeba4a2bfa84980`

Hashes for cutensor_cu11-2.0.0-py3-none-manylinux2014_aarch64.whl

Hashes for cutensor_cu11-2.0.0-py3-none-manylinux2014_aarch64.whl
Algorithm	Hash digest
SHA256	`8106b70be277759154b9a889342183b29c418bf4154c81b15b9750e99d838918`
MD5	`1bc2b5de2dcf4ff88bc2d5bb19f2c1ef`
BLAKE2b-256	`14f943888d33378e045e5ee83a0860b9201fddd15523f362535b706af7a75d87`