Skip to main content

Parses date/time from paths using glob wildcard pattern intertwined with date/time format

Project description

datetime-glob

Parses date/times from a path given a glob pattern intertwined with date/time format akin to strptime/strftime format.

datetime.datetime.strptime suffices for simple date/time parsing. However, as soon as you need to handle wildcards, it becomes tricky and you need to resort to regular expressions.

We found the glob patterns and strptime format to be far easier to read and write than regular expressions, and encapsulated the logic involving regular expressions in this module.

Installation

  • Create a virtual environment:

python3 -m venv venv3
  • Activate it:

source venv3/bin/activate
  • Install datetime-glob with pip:

pip3 install datetime-glob

Usage

To match a path:

>>> import datetime_glob
>>> matcher = datetime_glob.Matcher(pattern='/some/path/*%Y-%m-%dT%H-%M-%SZ.jpg')
>>> matcher.match(path='/some/path/some-text2016-07-03T21-22-23Z.jpg')
datetime_glob.Match(year = 2016, month = 7, day = 3, hour = 21, minute = 22, second = 23, microsecond = None)

>>> match.as_datetime()
datetime.datetime(2016, 7, 3, 21, 22, 23)
>>> match.as_date()
datetime.date(2016, 7, 3)
>>> match.as_time()
datetime.time(21, 22, 23)

If you specify a directive for the same field twice, the matcher will make sure that the field has the same semantical value in order to match:

>>> import datetime_glob
>>> matcher = datetime_glob.Matcher(pattern='/some/path/%y/%Y-%m-%d.txt')

>>> match = matcher.match(path='/some/path/16/2016-07-03.txt')
>>> match
datetime_glob.Match(year = 2016, month = 7, day = 3, hour = None, minute = None, second = None, microsecond = None)

>>> match = matcher.match(path='/some/path/19/2016-07-03.txt')
>>> type(match)
<class 'NoneType'>

To iterate over a tree, and match incrementally each path segment by yourself:

>>> import datetime_glob
>>> pattern_segments = datetime_glob.parse_pattern(pattern='/some/path/*%Y/%m/%d/%H-%M-%SZ.jpg')
>>> match = datetime_glob.Match()

>>> match=datetime_glob.match_segment(segment='some', pattern_segment=pattern_segments[0], match=match)
>>> match
datetime_glob.Match(year = None, month = None, day = None, hour = None, minute = None, second = None,
                    microsecond = None)

>>> match=datetime_glob.match_segment(segment='path', pattern_segment=pattern_segments[1], match=match)
>>> match
datetime_glob.Match(year = None, month = None, day = None, hour = None, minute = None, second = None,
                    microsecond = None)

>>> match=datetime_glob.match_segment(segment='some-text2016', pattern_segment=pattern_segments[2], match=match)
>>> match
datetime_glob.Match(year = 2016, month = None, day = None, hour = None, minute = None, second = None,
                    microsecond = None)

>>> match=datetime_glob.match_segment(segment='07', pattern_segment=pattern_segments[3], match=match)
>>> match
datetime_glob.Match(year = 2016, month = 7, day = None, hour = None, minute = None, second = None,
                    microsecond = None)

>>> match=datetime_glob.match_segment(segment='03', pattern_segment=pattern_segments[4], match=match)
>>> match
datetime_glob.Match(year = 2016, month = 7, day = 3, hour = None, minute = None, second = None, microsecond = None)

>>> match=datetime_glob.match_segment(segment='21-22-23Z.jpg', pattern_segment=pattern_segments[5], match=match)
>>> match
datetime_glob.Match(year = 2016, month = 7, day = 3, hour = 21, minute = 22, second = 23, microsecond = None)

Supported strftime directives

(subset from https://docs.python.org/3/library/datetime.html#strftime-and-strptime-behavior)

Directive

Meaning

Example

%d

Day of the month as a zero-padded decimal number.

01, 02, …, 31

%-d

Day of the month as a decimal number.

1, 2, …, 31

%m

Month as a zero-padded decimal number.

01, 02, …, 12

%-m

Month as a decimal number.

1, 2, …, 12

%y

Year without century as a zero-padded decimal number.

00, 01, …, 99

%Y

Year with century as a decimal number.

1970, 1988, 2001, 2013

%H

Hour (24-hour clock) as a zero-padded decimal number.

00, 01, …, 23

%-H

Hour (24-hour clock) as a decimal number.

0, 1, …, 23

%M

Minute as a zero-padded decimal number.

00, 01, …, 59

%-M

Minute as a decimal number.

0, 1, …, 59

%S

Second as a zero-padded decimal number.

00, 01, …, 59

%-S

Second as a decimal number.

0, 1, …, 59

%f

Microsecond as a decimal number, zero-padded on the left.

000000, 000001, …, 999999

%%

A literal ‘%’ character.

%

Development

  • Check out the repository.

  • In the repository root, create the virtual environment:

python3 -m venv venv3
  • Activate the virtual environment:

source venv3/bin/activate
  • Install the development dependencies:

pip3 install -e .[dev]
  • Run precommit.py to execute pre-commit checks locally.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

datetime-glob-1.0.5.tar.gz (6.8 kB view details)

Uploaded Source

File details

Details for the file datetime-glob-1.0.5.tar.gz.

File metadata

File hashes

Hashes for datetime-glob-1.0.5.tar.gz
Algorithm Hash digest
SHA256 3fe0f5b077867c997acbfffe43a11df67b7b01d8af21e27d5fe102df7a6a3ed7
MD5 0aa69bce5ffeb9436e26738afd07719a
BLAKE2b-256 f1fadb604da55b25037f99a3cadbc1ca06bf5d8977443d18a0c8e3f9186b0841

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page