A code generator for array-based code on CPUs and GPUs
Project description
Loopy lets you easily generate the tedious, complicated code that is necessary to get good performance out of GPUs and multi-core CPUs. Loopy’s core idea is that a computation should be described simply and then transformed into a version that gets high performance. This transformation takes place under user control, from within Python.
It can capture the following types of optimizations:
Vector and multi-core parallelism in the OpenCL/CUDA model
Data layout transformations (structure of arrays to array of structures)
Loop unrolling
Loop tiling with efficient handling of boundary cases
Prefetching/copy optimizations
Instruction level parallelism
and many more
Loopy targets array-type computations, such as the following:
dense linear algebra,
convolutions,
n-body interactions,
PDE solvers, such as finite element, finite difference, and Fast-Multipole-type computations
It is not (and does not want to be) a general-purpose programming language.
Loopy is licensed under the liberal MIT license and free for commercial, academic, and private use. All of Loopy’s dependencies can be automatically installed from the package index after using:
pip install loopy
In addition, Loopy is compatible with and enhances pyopencl.
—
Places on the web related to Loopy:
Python package index (download releases)
Documentation (read how things work)
Github (get latest source code, file bugs)
Project details
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
File details
Details for the file loopy-2020.2.1.tar.gz
.
File metadata
- Download URL: loopy-2020.2.1.tar.gz
- Upload date:
- Size: 483.3 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/3.2.0 pkginfo/1.5.0.1 requests/2.24.0 setuptools/44.0.0 requests-toolbelt/0.9.1 tqdm/4.48.2 CPython/3.8.6
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | 8726ff3b0396301e26cf61e855bf8d0c19ab3ff4c4ad705edeae4eeef081c081 |
|
MD5 | 730da68bf4fb9428b915cdc80907fa65 |
|
BLAKE2b-256 | 6aab9eb8884dff0dba27105c513e0e551df5b9461eb356a7523a6cbb2e8b7dba |