Skip to main content

Powerful data structures for data analysis and statistics

Project description

pandas is a Python package providing fast, flexible, and expressive data structures designed to make working with “relational” or “labeled” data both easy and intuitive. It aims to be the fundamental high-level building block for doing practical, real world data analysis in Python. Additionally, it has the broader goal of becoming the most powerful and flexible open source data analysis / manipulation tool available in any language. It is already well on its way toward this goal.

pandas is well suited for many different kinds of data:

  • Tabular data with heterogeneously-typed columns, as in an SQL table or Excel spreadsheet

  • Ordered and unordered (not necessarily fixed-frequency) time series data.

  • Arbitrary matrix data (homogeneously typed or heterogeneous) with row and column labels

  • Any other form of observational / statistical data sets. The data actually need not be labeled at all to be placed into a pandas data structure

The two primary data structures of pandas, Series (1-dimensional) and DataFrame (2-dimensional), handle the vast majority of typical use cases in finance, statistics, social science, and many areas of engineering. For R users, DataFrame provides everything that R’s data.frame provides and much more. pandas is built on top of NumPy and is intended to integrate well within a scientific computing environment with many other 3rd party libraries.

Here are just a few of the things that pandas does well:

  • Easy handling of missing data (represented as NaN) in floating point as well as non-floating point data

  • Size mutability: columns can be inserted and deleted from DataFrame and higher dimensional objects

  • Automatic and explicit data alignment: objects can be explicitly aligned to a set of labels, or the user can simply ignore the labels and let Series, DataFrame, etc. automatically align the data for you in computations

  • Powerful, flexible group by functionality to perform split-apply-combine operations on data sets, for both aggregating and transforming data

  • Make it easy to convert ragged, differently-indexed data in other Python and NumPy data structures into DataFrame objects

  • Intelligent label-based slicing, fancy indexing, and subsetting of large data sets

  • Intuitive merging and joining data sets

  • Flexible reshaping and pivoting of data sets

  • Hierarchical labeling of axes (possible to have multiple labels per tick)

  • Robust IO tools for loading data from flat files (CSV and delimited), Excel files, databases, and saving / loading data from the ultrafast HDF5 format

  • Time series-specific functionality: date range generation and frequency conversion, moving window statistics, moving window linear regressions, date shifting and lagging, etc.

Many of these principles are here to address the shortcomings frequently experienced using other languages / scientific research environments. For data scientists, working with data is typically divided into multiple stages: munging and cleaning data, analyzing / modeling it, then organizing the results of the analysis into a form suitable for plotting or tabular display. pandas is the ideal tool for all of these tasks.

Note

Windows binaries built against NumPy 1.6.1

Project details


Release history Release notifications | RSS feed

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

pandas-0.5.0.tar.gz (1.6 MB view details)

Uploaded Source

Built Distributions

pandas-0.5.0.win-amd64-py3.2.exe (826.1 kB view details)

Uploaded Source

pandas-0.5.0.win-amd64-py3.1.exe (825.0 kB view details)

Uploaded Source

pandas-0.5.0.win-amd64-py2.7.exe (789.1 kB view details)

Uploaded Source

pandas-0.5.0.win-amd64-py2.6.exe (789.0 kB view details)

Uploaded Source

pandas-0.5.0.win32-py3.2.exe (692.9 kB view details)

Uploaded Source

pandas-0.5.0.win32-py3.1.exe (692.8 kB view details)

Uploaded Source

pandas-0.5.0.win32-py2.7.exe (688.9 kB view details)

Uploaded Source

pandas-0.5.0.win32-py2.6.exe (688.6 kB view details)

Uploaded Source

pandas-0.5.0.win32-py2.5.exe (552.4 kB view details)

Uploaded Source

File details

Details for the file pandas-0.5.0.tar.gz.

File metadata

  • Download URL: pandas-0.5.0.tar.gz
  • Upload date:
  • Size: 1.6 MB
  • Tags: Source
  • Uploaded using Trusted Publishing? No

File hashes

Hashes for pandas-0.5.0.tar.gz
Algorithm Hash digest
SHA256 2e4eafc80cea5254cf8cea40761b65e8380bfc151f56b075bc97c081d9666a2c
MD5 2e02666b380d83dc86bebe17ef59c1c6
BLAKE2b-256 7f31a9512b17c946c5b18b170d365fff27aa9cbfea84c3d2a26c2787e2a1f83b

See more details on using hashes here.

File details

Details for the file pandas-0.5.0.win-amd64-py3.2.exe.

File metadata

File hashes

Hashes for pandas-0.5.0.win-amd64-py3.2.exe
Algorithm Hash digest
SHA256 4bc00214aa1f4ba223c990d124d14def5e0be955d2b979e3566ac3e1c55bdf84
MD5 16ff9648e2ce5bf9192003b65bef4448
BLAKE2b-256 d8fea32cf083c5cabf80a136786c29d24537997ce65a45391dab70edc1aaca1f

See more details on using hashes here.

File details

Details for the file pandas-0.5.0.win-amd64-py3.1.exe.

File metadata

File hashes

Hashes for pandas-0.5.0.win-amd64-py3.1.exe
Algorithm Hash digest
SHA256 d30a1d6d5cdc93a150386240150989d841af5cec3398282fe4ad1b0dd98273a1
MD5 5be49b4602f48d27c913446d7eccde5b
BLAKE2b-256 1e677ba2714605fc7f52a2e836bb1e82b4c42832566719ae28ee4f890e0c139d

See more details on using hashes here.

File details

Details for the file pandas-0.5.0.win-amd64-py2.7.exe.

File metadata

File hashes

Hashes for pandas-0.5.0.win-amd64-py2.7.exe
Algorithm Hash digest
SHA256 d186947bf8e17a8a4b1dd2bbced1a0b0ab43d36f37cf39f0311bd47cc6a05125
MD5 0c7851374b5d4b7d79d12c2d5b60575b
BLAKE2b-256 4f98ce1804233d2c5bdcc1d7b1ae85f1aecb19d9d4bfc2fb400acf150c935ad3

See more details on using hashes here.

File details

Details for the file pandas-0.5.0.win-amd64-py2.6.exe.

File metadata

File hashes

Hashes for pandas-0.5.0.win-amd64-py2.6.exe
Algorithm Hash digest
SHA256 6ffa6259ec323dd13790031a7ee45d7fe4232acdcd4e04a093358c1b61ed7c14
MD5 39f147ed8616657357bdcde736ef2f5c
BLAKE2b-256 213907731c1511f0cf7adf2f852467c13a1e674712f1df90c650b73f6f2aaa62

See more details on using hashes here.

File details

Details for the file pandas-0.5.0.win32-py3.2.exe.

File metadata

File hashes

Hashes for pandas-0.5.0.win32-py3.2.exe
Algorithm Hash digest
SHA256 144884bca339fb89ab5786a9c8500022066baae5ec8dc3b53e5e3392c680e89b
MD5 41f25e627c3fbbd2b806c291a603c8ab
BLAKE2b-256 01a8a18325e06aa1b031fa562aee57a6a1c6f22ace38b50f6832a8240cb78731

See more details on using hashes here.

File details

Details for the file pandas-0.5.0.win32-py3.1.exe.

File metadata

File hashes

Hashes for pandas-0.5.0.win32-py3.1.exe
Algorithm Hash digest
SHA256 33dd1a55f6ac3db1dad5f3a37df69189c43f1deefd5e57b193f2018e42d68aa7
MD5 620829d6ef2f9b47ddcc29afb5618758
BLAKE2b-256 e8b3b469acae85fb7d20de1bde65856afefd1a768efef8136d222ef1ff70dd8b

See more details on using hashes here.

File details

Details for the file pandas-0.5.0.win32-py2.7.exe.

File metadata

File hashes

Hashes for pandas-0.5.0.win32-py2.7.exe
Algorithm Hash digest
SHA256 533c314d96bdf9b0cff7b0cf85155b25719e6724ed2992ee5225b6667a69f0e2
MD5 c2badf1d82d48a57abcff72228d28cd9
BLAKE2b-256 4b4a999de1e736c1e6960a0ff5225044d5f1b5c033387996c585772c0b74818c

See more details on using hashes here.

File details

Details for the file pandas-0.5.0.win32-py2.6.exe.

File metadata

File hashes

Hashes for pandas-0.5.0.win32-py2.6.exe
Algorithm Hash digest
SHA256 f7e8811288d5dbc9cc1837a95af1674ab66cb49560050fdc29e40ec44d2a9424
MD5 7760fe59b2bdc51104e04868102936ce
BLAKE2b-256 2d4cc2d8fd0af43654c306b8a9f794f5dc0799baaec9b6e1cd8bfeb0e0e40531

See more details on using hashes here.

File details

Details for the file pandas-0.5.0.win32-py2.5.exe.

File metadata

File hashes

Hashes for pandas-0.5.0.win32-py2.5.exe
Algorithm Hash digest
SHA256 43a858cd43f633498bbe9df6147a9cf5104a6358bfe557e098b1f4aba5b7fa9f
MD5 f09e328841a9ba0482c26271a0631ae2
BLAKE2b-256 1fde58ebda3b7525687b39a8afd85a82d19f33ab0c772c63a833a9cd4d215b4d

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page