Skip to main content

Data sampler from streaming data

Project description

StreamSampler

StreamSampler package allows you to sample a particular number of elements from a stream of data of which length is very large or unknown.

StreamSampler is provided in both forms of an executable command and library. It utilizes Reservoir sampling algorithm [Vitter85]

You can take a look at the README.txt of other projects, such as repoze.bfg (http://bfg.repoze.org/trac/browser/trunk/README.txt) for some ideas.

License

MIT License

See Also

  • sample-cli by Paul Butler is a command line tool providing almost the same feature. StreamSampler is intended to be a library, although it has a command line interface, so that it can be a part of other packages including my future projects.

News

0.1.0

First public version

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

StreamSampler-0.1.0.tar.gz (4.2 kB view details)

Uploaded Source

File details

Details for the file StreamSampler-0.1.0.tar.gz.

File metadata

File hashes

Hashes for StreamSampler-0.1.0.tar.gz
Algorithm Hash digest
SHA256 7bea32c7d2ee6b0e08f4df5e06291681789bdc1ecc07ff9c65100d55fd85dd4c
MD5 5bebb155c7218a98474d1d7928fe79cf
BLAKE2b-256 aef53cc7d80103c3427480bf125e657c8b38dace4340e8e969859e98d53a019b

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page