Skip to main content

Cross-language UserAgent classifier library, python implementation

Project description

travis-ci.org coveralls.io downloads latest version license

The Python implementation of Project Woothee, which is multi-language user-agent strings parsers.

https://github.com/woothee/woothee

Installation

$ pip install woothee

Usage

Parsing user-agent

import woothee
woothee.parse("Mozilla/4.0 (compatible; MSIE 8.0; Windows NT 6.1; Trident/4.0)")
# => {'name': 'Internet Explorer', 'category': 'pc', 'os': 'Windows 7', 'version': '8.0', 'vendor': 'Microsoft', 'os_version': 'NT 6.1'}

Parse user-agent string and returns a object with keys name, category, os, version, vendor and os_version.

For unknown user-agent (or partially failed to parse), result objects may have value ‘UNKNOWN’.

  • category

    • labels of user terminal type, one of ‘pc’, ‘smartphone’, ‘mobilephone’, ‘appliance’, ‘crawler’ or ‘misc’ (or ‘UNKNOWN’)

  • name

    • the name of browser, like ‘Internet Explorer’, ‘Firefox’, ‘GoogleBot’

  • version

    • version string, like ‘8.0’ for IE, ‘9.0.1’ for Firefix, ‘0.2.149.27’ for Chrome, and so on

  • os

    • ex: ‘Windows 7’, ‘Mac OSX’, ‘iPhone’, ‘iPad’, ‘Android’

    • This field used to indicate cellar phone carrier for category ‘mobilephone’

  • vendor

    • optional field, shows browser vendor

  • os_version

    • optional field, shows version of operating systems

Finding crawlers (almost all, not all) in fast

woothee.is_crawler('Mozilla/4.0 (compatible; MSIE 8.0; Windows NT 6.1; Trident/4.0)')
# => False

Try to see useragent’s category is ‘crawler’ or not, by casual(fast) method. Minor case of crawlers is not tested in this method. To check crawler strictly, use woothee.parse(str)['category'] == 'crawler'.

Authors

License

Copyright 2012- TAGOMORI Satoshi (tagomoris)

Licensed under the Apache License, Version 2.0 (the “License”); you may not use this file except in compliance with the License. You may obtain a copy of the License at

http://www.apache.org/licenses/LICENSE-2.0

Unless required by applicable law or agreed to in writing, software distributed under the License is distributed on an “AS IS” BASIS, WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied. See the License for the specific language governing permissions and limitations under the License.

History

1.1.0(Mar 1, 2015)

1.0.0(Jan 20, 2015)

  • First release

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

woothee-1.1.0.tar.gz (13.2 kB view details)

Uploaded Source

Built Distribution

woothee-1.1.0-py2.py3-none-any.whl (15.0 kB view details)

Uploaded Python 2 Python 3

File details

Details for the file woothee-1.1.0.tar.gz.

File metadata

  • Download URL: woothee-1.1.0.tar.gz
  • Upload date:
  • Size: 13.2 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No

File hashes

Hashes for woothee-1.1.0.tar.gz
Algorithm Hash digest
SHA256 edd6a2b0246621993775e4a380f533b466565f2f278d0e7d5a2ddf6e0043549c
MD5 f23293835d67a312e7092c732a58381d
BLAKE2b-256 73ed0494e3f46e14cd2e4e103aa65d26400e24676426b02a680f89fb4f7e436f

See more details on using hashes here.

File details

Details for the file woothee-1.1.0-py2.py3-none-any.whl.

File metadata

File hashes

Hashes for woothee-1.1.0-py2.py3-none-any.whl
Algorithm Hash digest
SHA256 eff415f94922115535413e1f77e20c75a664ffdda3ee174bc5fc29bc7eb10eb4
MD5 3ff5e6ce4e96e3a390a216f3b8c0fc18
BLAKE2b-256 39fa890ea73de994dd5b25b83bf0ea087e228e1ee60cfb4bffc61cbddf446a7a

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page