Python Package for riptable studies framework

These details have not been verified by PyPI

Project links

Development Status
- 4 - Beta
License
- OSI Approved :: BSD License
Operating System
- OS Independent
Programming Language

Project description

Riptable

An open-source, 64-bit Python analytics engine for high-performance data analysis with multithreading support. Riptable supports Python 3.9 through 3.11 on 64-bit Linux and Windows.

Similar to Pandas and based on NumPy, Riptable optimizes analyzing large volumes of data interactively, in real time. Riptable can crunch numbers often at 1.5x to 10x the speed of NumPy or Pandas.

Riptable achieves maximum speed through the use of:

Vector instrinsics with hand-rolled loops using AVX-256 and with AVX-512 support coming.
Parallel computing with multiple-thread deployment for large arrays.
Recycling with built-in array garbage collection.
Hashing and parallel sorts for core algorithms.

Intro to Riptable and reference documentation is available at: riptable.readthedocs.io

Basic concepts and classes

FastArray is a subclass of NumPy's ndarray that enables built-in multithreaded number crunching. All Scikit routines that expect a NumPy array also accept a FastArray.

Dataset replaces the Pandas DataFrame class and holds NumPy arrays of equal length.

Struct holds a collection of mixed-type data members, with Dataset as a subclass.

Categorical replaces both the Pandas DataFrame.groupby() method and the Pandas Categorical class. A Riptable Categorical supports multi-key, filterable groupings with the same functionality of Pandas groupby and more.

Datetime classes replace most NumPy and Pandas date/time classes. Riptable's DateTimeNano, Date, TimeSpan, and DateSpan classes have a design that's closer to Java, C++, or C# date/time classes.

Accum2 and AccumTable enable cross-tabulation functionality.

SDS provides a new file format which can stack multiple datasets in multiple files with zstd compression, threads, and no extra memory copies.

Small, medium, and large array performance

Riptable is designed for arrays of all sizes. For small arrays (< 100 length), low processing overhead is important. Riptable's FastArray is written in hand-coded C and processes simple arithmetic functions faster than NumPy arrays. For medium arrays (< 100,000 length), Riptable has vector-instrinic loops. For large arrays (>= 100,000) Riptable knows how to dynamically scale out threading, waking up threads efficiently using a futex.

Install and import Riptable

Create a Conda environment and run this command to install Riptable on Windows or Linux:

conda install riptable

Import Riptable in your Python code to access its functions, methods, and classes:

import riptable as rt

Note: We shorten the name of the Riptable module to rt to improve the readability of code.

Use NumPy arrays with Riptable

Easily change between NumPy's ndarray and Riptable's FastArray without producing a copy of the array.

import riptable as rt
import numpy as np
rtarray = rt.arange(100)
numpyarray = rtarray._np
fastarray = rt.FastArray(numpyarray)

Change the view of the two instances to confirm that FastArray is a subclass of ndarray.

numpyarray.view(rt.FastArray)
fastarray.view(np.ndarray)
isinstance(fastarray, np.ndarray)

Use Pandas DataFrames with Riptable

Construct a Riptable Dataset directly from a Pandas DataFrame.

import riptable as rt
import numpy as np
import pandas as pd
df = pd.DataFrame({"intarray": np.arange(1_000_000), "floatarray": np.arange(1_000_000.0)})
ds = rt.Dataset(df)

How can I trust Riptable calculations?

Riptable has undergone years of development, and dozens of quants at a large financial firm have tested its capabilities. We also provide a full suite of tests to ensure that the modules are functioning as expected. But as with any project, there are still bugs and opportunities for improvement, which can be reported using GitHub issues.

How can Riptable perform calculations faster?

Riptable was written from day one to handle large data and multithreading using the riptide_cpp layer for basic arithmetic functions and algorithms. Many core algorithms have been painstakingly rewritten for multithreading.

How can I contribute?

The Riptable engine is another building block for Python data analytics computing, and we welcome help from users and contributors to take it to the next level. As you encounter bugs, issues with the documentation, and opportunities for new or improved functionality, please consider reaching out to the team.

See the contributing guide for more information.

Project details

These details have not been verified by PyPI

Project links

Development Status
- 4 - Beta
License
- OSI Approved :: BSD License
Operating System
- OS Independent
Programming Language

Release history Release notifications | RSS feed

1.17.1

Apr 25, 2024

1.17.0

Apr 23, 2024

1.16.1

Apr 9, 2024

1.16.0

Mar 27, 2024

1.15.0

Mar 7, 2024

1.14.5

Feb 7, 2024

1.14.4

Jan 31, 2024

1.14.3

Jan 4, 2024

1.14.2

Dec 15, 2023

1.14.1

Dec 1, 2023

1.14.0

Nov 2, 2023

1.13.4

Oct 5, 2023

1.13.3

Oct 5, 2023

This version

1.13.2

Sep 27, 2023

1.13.1

Sep 12, 2023

1.13.0

Aug 31, 2023

1.12.0

Aug 16, 2023

1.11.0

Aug 1, 2023

1.10.0

Jul 24, 2023

1.9.2

Jul 12, 2023

1.9.1

Jun 22, 2023

1.9.0

Jun 14, 2023

1.8.1

May 26, 2023

1.8.0

May 18, 2023

1.7.0

May 9, 2023

1.6.11

Apr 18, 2023

1.6.10

Apr 13, 2023

1.6.9

Apr 5, 2023

1.6.8

Mar 28, 2023

1.6.7

Mar 7, 2023

1.6.6

Mar 6, 2023

1.6.5

Feb 22, 2023

1.6.4

Jan 25, 2023

1.6.3

Jan 17, 2023

1.6.2

Jan 9, 2023

1.6.1

Dec 21, 2022

1.6.0

Dec 15, 2022

1.5.1

Dec 1, 2022

1.5.0

Nov 9, 2022

1.4.2

Oct 27, 2022

1.4.1

Oct 21, 2022

1.4.0

Oct 18, 2022

1.3.6

Aug 24, 2022

1.3.5

Apr 18, 2022

1.3.4

Apr 13, 2022

1.3.3

Mar 11, 2022

1.3.2

Mar 9, 2022

1.3.1

Feb 23, 2022

1.2.9

Jan 31, 2022

1.2.8

Jan 19, 2022

1.2.7

Jan 19, 2022

1.2.6

Dec 23, 2021

1.2.5

Dec 15, 2021

1.2.4

Dec 8, 2021

1.2.3

Dec 8, 2021

1.2.2

Dec 2, 2021

1.2.1

Nov 18, 2021

1.2.0

Nov 16, 2021

1.1.4

Oct 22, 2021

1.1.3

Oct 8, 2021

1.1.2

Oct 6, 2021

1.1.1

Oct 1, 2021

1.1.0

Aug 11, 2021

1.0.58

Aug 11, 2021

1.0.57

Jul 13, 2021

1.0.56

Jul 2, 2021

1.0.55

Jun 29, 2021

1.0.54

Jun 3, 2021

1.0.53

Jun 1, 2021

1.0.42

Jan 20, 2021

1.0.41

Jan 11, 2021

1.0.40

Dec 22, 2020

1.0.39

Dec 21, 2020

1.0.38

Dec 10, 2020

1.0.37

Dec 10, 2020

1.0.36

Dec 9, 2020

1.0.35

Dec 4, 2020

1.0.34

Dec 4, 2020

1.0.33

Dec 3, 2020

1.0.32

Nov 30, 2020

1.0.31

Nov 27, 2020

1.0.29

Nov 25, 2020

1.0.27

Nov 23, 2020

1.0.26

Nov 17, 2020

1.0.25

Oct 22, 2020

1.0.24

Oct 20, 2020

1.0.19

Sep 22, 2020

1.0.17

Sep 18, 2020

1.0.15

Sep 17, 2020

1.0.11

Sep 9, 2020

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

riptable-1.13.2.tar.gz (1.6 MB view details)

Uploaded Sep 27, 2023 Source

File details

Details for the file riptable-1.13.2.tar.gz.

File metadata

Download URL: riptable-1.13.2.tar.gz
Upload date: Sep 27, 2023
Size: 1.6 MB
Tags: Source
Uploaded using Trusted Publishing? No
Uploaded via: twine/4.0.2 CPython/3.11.5

File hashes

Hashes for riptable-1.13.2.tar.gz
Algorithm	Hash digest
SHA256	`93a993baba0af107e4a7a825008f1ccb0e2f8dd9c261481c5e5d6c6d3b3a6c93`
MD5	`6b0c9a02b13938578599d822ba5bdb0d`
BLAKE2b-256	`cecae8cb2da4fe0793d1e0c40ddeff1f7ccf0698056ce374e7f5449989bd194f`