Skip to main content

Cross-language UserAgent classifier library, python implementation

Project description

travis-ci.org coveralls.io latest version license

The Python implementation of Project Woothee, which is multi-language user-agent strings parsers.

https://github.com/woothee/woothee

Installation

$ pip install woothee

Usage

Parsing user-agent

import woothee
woothee.parse("Mozilla/4.0 (compatible; MSIE 8.0; Windows NT 6.1; Trident/4.0)")
# => {'name': 'Internet Explorer', 'category': 'pc', 'os': 'Windows 7', 'version': '8.0', 'vendor': 'Microsoft', 'os_version': 'NT 6.1'}

Parse user-agent string and returns a object with keys name, category, os, version, vendor and os_version.

For unknown user-agent (or partially failed to parse), result objects may have value ‘UNKNOWN’.

  • category

    • labels of user terminal type, one of ‘pc’, ‘smartphone’, ‘mobilephone’, ‘appliance’, ‘crawler’ or ‘misc’ (or ‘UNKNOWN’)

  • name

    • the name of browser, like ‘Internet Explorer’, ‘Firefox’, ‘GoogleBot’

  • version

    • version string, like ‘8.0’ for IE, ‘9.0.1’ for Firefix, ‘0.2.149.27’ for Chrome, and so on

  • os

    • ex: ‘Windows 7’, ‘Mac OSX’, ‘iPhone’, ‘iPad’, ‘Android’

    • This field used to indicate cellar phone carrier for category ‘mobilephone’

  • vendor

    • optional field, shows browser vendor

  • os_version

    • optional field, shows version of operating systems

Finding crawlers (almost all, not all) in fast

woothee.is_crawler('Mozilla/4.0 (compatible; MSIE 8.0; Windows NT 6.1; Trident/4.0)')
# => False

Try to see useragent’s category is ‘crawler’ or not, by casual(fast) method. Minor case of crawlers is not tested in this method. To check crawler strictly, use woothee.parse(str)['category'] == 'crawler'.

Authors

License

Copyright 2012- TAGOMORI Satoshi (tagomoris)

Licensed under the Apache License, Version 2.0 (the “License”); you may not use this file except in compliance with the License. You may obtain a copy of the License at

http://www.apache.org/licenses/LICENSE-2.0

Unless required by applicable law or agreed to in writing, software distributed under the License is distributed on an “AS IS” BASIS, WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied. See the License for the specific language governing permissions and limitations under the License.

History

1.10.1(Aug 8, 2019)

1.10.0(Apr 14, 2019)

1.8.0(Jul 5, 2018)

1.7.0(May 7, 2017)

  • #12 Release v1.7.0

  • Add support for WebView on Android.

  • Add support for curl.

  • Add support for trendictionbot crawler.

  • Add support for Yeti 1.1.

  • Compatible with Python 3.6.

  • Caution. We’ll drop Python2.6 and python3.2 support in the next version.

1.5.0(Aug 16, 2016)

1.4.0(May 17, 2016)

  • Add support for Vivaldi

1.3.0(Jan 7, 2016)

  • Add support for Firefox for iOS

1.2.0(Aug 16, 2015)

  • Add support for Twitterbot

  • Add support for webviews of mobile devices

  • Add support for Windows 10 and Edge browser

  • Add support for BlackBerry10

1.1.0(Mar 1, 2015)

1.0.0(Jan 20, 2015)

  • First release

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

woothee-1.10.1.tar.gz (17.5 kB view details)

Uploaded Source

Built Distribution

woothee-1.10.1-py2.py3-none-any.whl (17.0 kB view details)

Uploaded Python 2 Python 3

File details

Details for the file woothee-1.10.1.tar.gz.

File metadata

  • Download URL: woothee-1.10.1.tar.gz
  • Upload date:
  • Size: 17.5 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/1.13.0 pkginfo/1.5.0.1 requests/2.22.0 setuptools/40.8.0 requests-toolbelt/0.9.1 tqdm/4.32.2 CPython/3.7.3

File hashes

Hashes for woothee-1.10.1.tar.gz
Algorithm Hash digest
SHA256 18f2129771a412890d4e32d6c50e0524b47dfe74ed2f2b7be6a6712a9086290b
MD5 38107b08bf58c62415f7b9b481040b33
BLAKE2b-256 fcfc0d3acc0cdb0dd7f5248d4c6adf0ac774571aa067ed8ba8299a8e3abbc897

See more details on using hashes here.

File details

Details for the file woothee-1.10.1-py2.py3-none-any.whl.

File metadata

  • Download URL: woothee-1.10.1-py2.py3-none-any.whl
  • Upload date:
  • Size: 17.0 kB
  • Tags: Python 2, Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/1.13.0 pkginfo/1.5.0.1 requests/2.22.0 setuptools/40.8.0 requests-toolbelt/0.9.1 tqdm/4.32.2 CPython/3.7.3

File hashes

Hashes for woothee-1.10.1-py2.py3-none-any.whl
Algorithm Hash digest
SHA256 33ed024f3c8cebb0400a1bf5a3c0c58a03b7ffab1b8f37e629a4bb662b2641b9
MD5 53b488d3f0caee9e0e94ecce55108ab8
BLAKE2b-256 adee1295c7d4bacd1acb292e62a6457374a3a65f1262a45b7e2c4f5934220535

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page