Skip to main content

Recursive descent parsing library based on functional combinators

Project description

funcparserlib

Recursive descent parsing library for Python based on functional combinators.

PyPI PyPI - Downloads

Description

Parser combinators are just higher-order functions that take parsers as their arguments and return them as result values. Parser combinators are:

  • First-class values
  • Extremely composable
  • Tend to make the code quite compact
  • Resemble the readable notation of xBNF grammars

Parsers made with funcparserlib are pure-Python LL(*) parsers. It means that it's very easy to write them without thinking about lookaheads and all that hardcore parsing stuff. However, the recursive descent parsing is a rather slow method compared to LL(k) or LR(k) algorithms.

So the primary domain for funcparserlib is parsing little languages or external DSLs (domain specific languages).

The library itself is very small. Its source code is only 600 lines of code, with lots of comments included. It features the longest parsed prefix error reporting, as well as a tiny lexer generator for token position tracking.

Show Me the Code

This is an excerpt from a JSON parser (RFC 4627) written using funcparserlib. This full example as well as others can be found here.

def parse(seq):
    """Sequence(Token) -> object"""
    ...
    n = lambda s: tok("Name", s)
    def make_array(values):
        if values is None:
            return []
        else:
            return [values[0]] + values[1]
    ...
    null = n("null") >> const(None)
    true = n("true") >> const(True)
    false = n("false") >> const(False)
    number = tok("Number") >> make_number
    string = tok("String") >> make_string
    value = forward_decl()
    member = string + -op(":") + value >> tuple
    object = (
        -op("{") +
        maybe(member + many(-op(",") + member)) +
        -op("}")
        >> make_object)
    array = (
        -op("[") +
        maybe(value + many(-op(",") + value)) +
        -op("]")
        >> make_array)
    value.define(
          null
        | true
        | false
        | object
        | array
        | number
        | string)
    json_text = object | array
    json_file = json_text + -finished

    return json_file.parse(seq)

Installation

You can install the funcparserlib library from PyPI via pip:

$ pip install funcparserlib

There are no dependencies on other libraries.

Documentation

See also comments inside the modules funcparserlib.parser and funcparserlib.lexer or generate the API docs from the modules using pydoc.

There a couple of examples available in the tests/ directory:

See also the changelog and FAQ.

Performance and Code Size

Despite being an LL(*) parser, funcparserlib has a reasonable performance. For example, a JSON parser written using funcparserlib is 3 times faster than a parser using the popular pyparsing library and only 5 times slower than the specialized JSON library simplejson that uses ad hoc parsing. Here are some stats:

File Size cjson simplejson funcparserlib json-ply pyparsing
6 KB 0 ms 45 ms 228 ms n/a 802 ms
11 KB 0 ms 80 ms 395 ms 367 ms 1355 ms
100 KB 4 ms 148 ms 855 ms 1071 ms 2611 ms
134 KB 11 ms 957 ms 4775 ms n/a 16534 ms
1009 KB 87 ms 6904 ms 36826 ms n/a 116510 ms
User Code 0.9 KLOC 0.8 KLOC 0.1 KLOC 0.5 KLOC 0.1 KLOC
Library Code 0 KLOC 0 KLOC 0.5 KLOC 5.3 KLOC 3.7 KLOC

Both funcparserlib and pyparsing have the smallest user code size (that is a common feature of parsing libraries compared to ad hoc parsers). The library code of funcparserlib is 7 times smaller (and much cleaner) than pyparsing. The json-ply uses a LALR parser ply (similar to Yacc) and performs like funcparserlib. cjson is a C library, hence the incredible performance :)

Used By

Some open-source projects that use funcparserlib as an explicit dependency:

Usages in tests / secondary dependencies:

Similar Projects

  • LEPL. A recursive descent parsing library that uses two-way generators for backtracking. Its source code is rather large: 17 KLOC.
  • pyparsing. A recursive descent parsing library. Probably the most popular Python parsing library. Nevertheless, its source code is quite dirty (though 4 KLOC only).
  • Monadic Parsing in Python. A series of blog entries on monadic parsing.
  • Pysec (aka Parsec in Python). A blog entry on monadic parsing, with nice syntax for Python.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

funcparserlib-1.0.0a0.tar.gz (17.7 kB view details)

Uploaded Source

Built Distribution

funcparserlib-1.0.0a0-py2.py3-none-any.whl (17.8 kB view details)

Uploaded Python 2 Python 3

File details

Details for the file funcparserlib-1.0.0a0.tar.gz.

File metadata

  • Download URL: funcparserlib-1.0.0a0.tar.gz
  • Upload date:
  • Size: 17.7 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/3.4.2 importlib_metadata/4.8.1 pkginfo/1.7.1 requests/2.26.0 requests-toolbelt/0.9.1 tqdm/4.62.2 CPython/3.9.7

File hashes

Hashes for funcparserlib-1.0.0a0.tar.gz
Algorithm Hash digest
SHA256 e6fb432a9276078eee71813a59089fd8c4f3e81f183b4ce8b8b4734d0b2faa2a
MD5 c6ec30bfa49b7012a704784c3a27c247
BLAKE2b-256 75e2a132f2b624a2bad2c3233afa7c8273ee192d0660b55dbcb22f68ff9de819

See more details on using hashes here.

File details

Details for the file funcparserlib-1.0.0a0-py2.py3-none-any.whl.

File metadata

  • Download URL: funcparserlib-1.0.0a0-py2.py3-none-any.whl
  • Upload date:
  • Size: 17.8 kB
  • Tags: Python 2, Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/3.4.2 importlib_metadata/4.8.1 pkginfo/1.7.1 requests/2.26.0 requests-toolbelt/0.9.1 tqdm/4.62.2 CPython/3.9.7

File hashes

Hashes for funcparserlib-1.0.0a0-py2.py3-none-any.whl
Algorithm Hash digest
SHA256 a5ca025aaec28ae19956556260f5e0f28b0d851863e7982a0daf238161e33752
MD5 eadb51dfedaeb1c30af691d499f2aab2
BLAKE2b-256 3d2d575bc3d05cf81634ade9c3e10a89c7c48fccaabe4346b347d9e677e3295d

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page