Skip to main content

Various data and utilities for processing wikitext.

Project description

mwconstants

Various utilities and constants useful for analyses of wikitext. This package contains three types of artifacts:

  • Data generating functions: Python functions for calling various APIs to build useful data structures -- e.g., all Wikipedia language codes
  • Static data snapshots: Python variables that contain the most recent result of a data generating function
  • Utilities: Python functions for handling various wikitext-related processing tasks -- e.g., mapping links to namespaces.

Installation

You can install mwconstants with pip:

   $ pip install mwconstants

Basic Usage

from mwconstants import link_to_namespace, NON_WHITESPACE_LANGUAGES

print(link_to_namespace('Utilisateur:Isaac_(WMF)', lang='fr'))  # 'User'
print(sorted(NON_WHITESPACE_LANGUAGES))  # ['bo', 'bug', ..., 'zh-classical', 'zh-yue']

Modules

All modules generally contain relevant constants, functions for generating those constants, and other useful utilities for manipulating them:

  • languages.py: functions for identifying languages associated with a given Wikimedia project.
  • media.py: functions for identifying media in wikitext and parsing wikitext media syntax into its components
  • namespaces.py: functions for identifying namespace prefixes

Limitations

  • Links have many edge-cases, especially around interwiki prefixes. For now, just the basics are covered: language-specific namespaces and interlanguage links

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

mwconstants-0.1.0.tar.gz (90.5 kB view details)

Uploaded Source

Built Distribution

mwconstants-0.1.0-py3-none-any.whl (93.9 kB view details)

Uploaded Python 3

File details

Details for the file mwconstants-0.1.0.tar.gz.

File metadata

  • Download URL: mwconstants-0.1.0.tar.gz
  • Upload date:
  • Size: 90.5 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/4.0.2 CPython/3.11.1

File hashes

Hashes for mwconstants-0.1.0.tar.gz
Algorithm Hash digest
SHA256 81cd90b1080e13053e55326856c1b78156c9f8612a403f0095c4f1a0b1488415
MD5 4b32d6bbbcbc452c8ebcb06b4c01a5a5
BLAKE2b-256 d585a5416773f5a33feeffd7b9b78992e07a78b7e33b2b57a448e66eeebd3c68

See more details on using hashes here.

Provenance

File details

Details for the file mwconstants-0.1.0-py3-none-any.whl.

File metadata

  • Download URL: mwconstants-0.1.0-py3-none-any.whl
  • Upload date:
  • Size: 93.9 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/4.0.2 CPython/3.11.1

File hashes

Hashes for mwconstants-0.1.0-py3-none-any.whl
Algorithm Hash digest
SHA256 3023602ed465a5428e59d5a5fc51b0151acb4fc6246714d572f542d9efbb0786
MD5 e0026009610924fd3d884bdbe0f04156
BLAKE2b-256 0d19e60761e20dd4e7609c76e126ba415b03de11c172dcc8290322dd8c75e069

See more details on using hashes here.

Provenance

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page