Skip to main content

Fast and reliable User Agent parser and IP classifier for Python

Project description

Udger client for Python (data ver. 3)
=====================================

Local parser is very fast and accurate useragent string detection solution. Enables developers to locally install and integrate a highly-scalable product.
We provide the detection of the devices (personal computer, tablet, Smart TV, Game console etc.), operating system and client SW type (browser, e-mail client etc.).
It also provides information about IP addresses (Public proxies, VPN services, Tor exit nodes, Fake crawlers, Web scrapers .. etc.)

- Tested with more the 50.000 unique user agents.
- Up to date data provided by https://udger.com/
- Support for Python 3

Enjoy!

Install using pip
-----------------

$ pip install udger

Install from git repo
---------------------

$ git clone https://github.com/udger/udger-python
$ cd udger-python/
# python setup.py install

Automatic updates download
--------------------------

For data auto update, please use Udger data updater (https://udger.com/support/documentation/?doc=62)

Help us
-------

Feel free to send us a Pull Request on GitHub to help us make Udger for Python better.
Or just let us know of any issues you encounter.

Thank you!

Usage
-----

$ python
>>> from pprint import pprint
>>> from udger import Udger
>>> udger = Udger()
>>>
>>> result = udger.parse_ua(
... 'Mozilla/5.0 (iPad; CPU OS 7_0 like Mac OS X) AppleWebKit/537.51.1 (KHTML, like Gecko) Version/7.0 Mobile/11A465 Safari/9537.53'
... )
>>> pprint(result)
{'crawler_category': None,
'crawler_category_code': None,
'crawler_last_seen': None,
'crawler_respect_robotstxt': None,
'device_brand': 'Apple',
'device_brand_code': 'apple',
'device_brand_homepage': 'http://www.apple.com/',
'device_brand_icon': 'apple.png',
'device_brand_icon_big': 'apple_big.png',
'device_brand_info_url': 'https://udger.com/resources/ua-list/devices-brand-detail?brand=apple',
'device_class': 'Tablet',
'device_class_code': 'tablet',
'device_class_icon': 'tablet.png',
'device_class_icon_big': 'tablet_big.png',
'device_class_info_url': 'https://udger.com/resources/ua-list/device-detail?device=Tablet',
'device_marketname': 'iPad',
'os': 'iOS 7',
'os_code': 'ios_7',
'os_family': 'iOS',
'os_family_code': 'ios',
'os_family_vendor': 'Apple Inc.',
'os_family_vendor_code': 'apple_inc',
'os_family_vendor_homepage': 'http://www.apple.com/',
'os_homepage': 'https://en.wikipedia.org/wiki/IOS_7',
'os_icon': 'iphone.png',
'os_icon_big': 'iphone_big.png',
'os_info_url': 'https://udger.com/resources/ua-list/os-detail?os=iOS%207',
'ua': 'Safari mobile 7.0',
'ua_class': 'Mobile browser',
'ua_class_code': 'mobile_browser',
'ua_engine': 'WebKit',
'ua_family': 'Safari mobile',
'ua_family_code': 'safari_mobile',
'ua_family_homepage': 'https://en.wikipedia.org/wiki/Safari_%28web_browser%29',
'ua_family_icon': 'safari.png',
'ua_family_icon_big': 'safari_big.png',
'ua_family_info_url': 'https://udger.com/resources/ua-list/browser-detail?browser=Safari%20mobile',
'ua_family_vendor': 'Apple Inc.',
'ua_family_vendor_code': 'apple_inc',
'ua_family_vendor_homepage': 'http://www.apple.com/',
'ua_string': 'Mozilla/5.0 (iPad; CPU OS 7_0 like Mac OS X) '
'AppleWebKit/537.51.1 (KHTML, like Gecko) Version/7.0 '
'Mobile/11A465 Safari/9537.53',
'ua_uptodate_current_version': '',
'ua_version': '7.0',
'ua_version_major': '7'}
>>>
>>> result = udger.parse_ua('Some thing')
>>> pprint(result)
{'crawler_category': None,
'crawler_category_code': None,
'crawler_last_seen': None,
'crawler_respect_robotstxt': None,
'device_brand': None,
'device_brand_code': None,
'device_brand_homepage': None,
'device_brand_icon': None,
'device_brand_icon_big': None,
'device_brand_info_url': None,
'device_class': None,
'device_class_code': None,
'device_class_icon': None,
'device_class_icon_big': None,
'device_class_info_url': None,
'device_marketname': None,
'os': None,
'os_code': None,
'os_family': None,
'os_family_code': None,
'os_family_vendor': None,
'os_family_vendor_code': None,
'os_family_vendor_homepage': None,
'os_homepage': None,
'os_icon': None,
'os_icon_big': None,
'os_info_url': None,
'ua': None,
'ua_class': 'Unrecognized',
'ua_class_code': 'unrecognized',
'ua_engine': None,
'ua_family': None,
'ua_family_code': None,
'ua_family_homepage': None,
'ua_family_icon': None,
'ua_family_icon_big': None,
'ua_family_info_url': None,
'ua_family_vendor': None,
'ua_family_vendor_code': None,
'ua_family_vendor_homepage': None,
'ua_string': 'Some thing',
'ua_uptodate_current_version': None,
'ua_version': None,
'ua_version_major': None}
>>>
>>> result = udger.parse_ip('69.89.31.120')
>>> pprint(result)
{'crawler_category': None,
'crawler_category_code': None,
'crawler_family': None,
'crawler_family_code': None,
'crawler_family_homepage': None,
'crawler_family_icon': None,
'crawler_family_info_url': None,
'crawler_family_vendor': None,
'crawler_family_vendor_code': None,
'crawler_family_vendor_homepage': None,
'crawler_last_seen': None,
'crawler_name': None,
'crawler_respect_robotstxt': None,
'crawler_ver': None,
'crawler_ver_major': None,
'datacenter_homepage': 'https://www.bluehost.com/',
'datacenter_name': 'Bluehost Inc.',
'datacenter_name_code': 'bluehost',
'ip': '69.89.31.120',
'ip_city': 'Provo',
'ip_classification': 'Web scraper',
'ip_classification_code': 'web_scraper',
'ip_country': 'United States',
'ip_country_code': 'US',
'ip_hostname': 'box320.bluehost.com',
'ip_last_seen': '2016-09-17 12:13:25',
'ip_ver': 4}
>>>
>>> result = udger.parse_ip('108.61.199.93')
>>> pprint(result)
{'crawler_category': 'Site monitor',
'crawler_category_code': 'site_monitor',
'crawler_family': 'PINGOMETER',
'crawler_family_code': 'pingometer',
'crawler_family_homepage': '',
'crawler_family_icon': 'bot_pingometer.png',
'crawler_family_info_url': 'https://udger.com/resources/ua-list/bot-detail?bot=PINGOMETER',
'crawler_family_vendor': 'Pingometer, LLC',
'crawler_family_vendor_code': 'pingometer_llc',
'crawler_family_vendor_homepage': 'http://pingometer.com/',
'crawler_last_seen': '2016-09-17 12:15:38',
'crawler_name': 'PINGOMETER',
'crawler_respect_robotstxt': 'no',
'crawler_ver': '',
'crawler_ver_major': '',
'datacenter_homepage': 'https://www.choopa.com/',
'datacenter_name': 'Choopa, LLC.',
'datacenter_name_code': 'choopa',
'ip': '108.61.199.93',
'ip_city': 'Amsterdam',
'ip_classification': 'Crawler',
'ip_classification_code': 'crawler',
'ip_country': 'Netherlands',
'ip_country_code': 'NL',
'ip_hostname': '108.61.199.93.vultr.com',
'ip_last_seen': '2016-09-17 12:00:31',
'ip_ver': 4}

Data directory
--------------

``Udger()`` parser expects the data file to be placed in the system temporary
directory as returned by the ``tempfile.gettempdir()``.

You may override the path using the argument like this:

udger = Udger('/var/cache/udger/')


Forked from
-----------

Based on the code by Jure Ham (jure.ham@zemanta.com),
https://github.com/hamaxx/uasparser2

Previously, a python version of https://github.com/kaittodesk/uasparser2
by Hicro Kee (http://hicrokee.com) email: hicrokee AT gmail DOT com
and modified by Michal Molhanec http://molhanec.net

Documentation for developers
----------------------------

https://udger.com/pub/documentation/parser/Python/html/

Author
------

The Udger.com Team (info@udger.com)

Old v1 format
-------------

If you still use the previous format of the db (v1), please see the branch ``old_format_v1``

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

udger-3.0.4.tar.gz (10.9 kB view details)

Uploaded Source

Built Distribution

udger-3.0.4-py2-none-any.whl (11.4 kB view details)

Uploaded Python 2

File details

Details for the file udger-3.0.4.tar.gz.

File metadata

  • Download URL: udger-3.0.4.tar.gz
  • Upload date:
  • Size: 10.9 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No

File hashes

Hashes for udger-3.0.4.tar.gz
Algorithm Hash digest
SHA256 12a9b2b590a7441d156304917761ef4620ca410b758e964d7433595d42a20d36
MD5 2768a300f83d1bb9c0e7612a57a185f3
BLAKE2b-256 fc5f247a5e7f1f4d2878df68bf2d018c42d2709a29c7680a0d56b333a39077f2

See more details on using hashes here.

File details

Details for the file udger-3.0.4-py2-none-any.whl.

File metadata

File hashes

Hashes for udger-3.0.4-py2-none-any.whl
Algorithm Hash digest
SHA256 5dc39794a93d95cb655c17256ed5e00294aea3f908aa63ea275c0a1105580147
MD5 e52f69c4e07ea5a15bfe3a5f7cda0cf4
BLAKE2b-256 6644cc1418ec5e4390c28264dc632fc5e3610332836a3dbe2db61696237b348e

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page