Fast and reliable User Agent parser and IP classifier for Python
Project description
Udger client for Python (data ver. 3)
=====================================
Local parser is very fast and accurate useragent string detection solution. Enables developers to locally install and integrate a highly-scalable product.
We provide the detection of the devices (personal computer, tablet, Smart TV, Game console etc.), operating system and client SW type (browser, e-mail client etc.).
It also provides information about IP addresses (Public proxies, VPN services, Tor exit nodes, Fake crawlers, Web scrapers .. etc.)
- Tested with more the 50.000 unique user agents.
- Up to date data provided by https://udger.com/
- Support for Python 3
Enjoy!
Install using pip
-----------------
$ pip install udger
Install from git repo
---------------------
$ git clone https://github.com/udger/udger-python
$ cd udger-python/
# python setup.py install
Automatic updates download
--------------------------
For data auto update, please use Udger data updater (https://udger.com/support/documentation/?doc=62)
Help us
-------
Feel free to send us a Pull Request on GitHub to help us make Udger for Python better.
Or just let us know of any issues you encounter.
Thank you!
Usage
-----
$ python
>>> from pprint import pprint
>>> from udger import Udger
>>> udger = Udger()
>>>
>>> result = udger.parse_ua(
... 'Mozilla/5.0 (iPad; CPU OS 7_0 like Mac OS X) AppleWebKit/537.51.1 (KHTML, like Gecko) Version/7.0 Mobile/11A465 Safari/9537.53'
... )
>>> pprint(result)
{'crawler_category': None,
'crawler_category_code': None,
'crawler_last_seen': None,
'crawler_respect_robotstxt': None,
'device_brand': 'Apple',
'device_brand_code': 'apple',
'device_brand_homepage': 'http://www.apple.com/',
'device_brand_icon': 'apple.png',
'device_brand_icon_big': 'apple_big.png',
'device_brand_info_url': 'https://udger.com/resources/ua-list/devices-brand-detail?brand=apple',
'device_class': 'Tablet',
'device_class_code': 'tablet',
'device_class_icon': 'tablet.png',
'device_class_icon_big': 'tablet_big.png',
'device_class_info_url': 'https://udger.com/resources/ua-list/device-detail?device=Tablet',
'device_marketname': 'iPad',
'os': 'iOS 7',
'os_code': 'ios_7',
'os_family': 'iOS',
'os_family_code': 'ios',
'os_family_vendor': 'Apple Inc.',
'os_family_vendor_code': 'apple_inc',
'os_family_vendor_homepage': 'http://www.apple.com/',
'os_homepage': 'https://en.wikipedia.org/wiki/IOS_7',
'os_icon': 'iphone.png',
'os_icon_big': 'iphone_big.png',
'os_info_url': 'https://udger.com/resources/ua-list/os-detail?os=iOS%207',
'ua': 'Safari mobile 7.0',
'ua_class': 'Mobile browser',
'ua_class_code': 'mobile_browser',
'ua_engine': 'WebKit',
'ua_family': 'Safari mobile',
'ua_family_code': 'safari_mobile',
'ua_family_homepage': 'https://en.wikipedia.org/wiki/Safari_%28web_browser%29',
'ua_family_icon': 'safari.png',
'ua_family_icon_big': 'safari_big.png',
'ua_family_info_url': 'https://udger.com/resources/ua-list/browser-detail?browser=Safari%20mobile',
'ua_family_vendor': 'Apple Inc.',
'ua_family_vendor_code': 'apple_inc',
'ua_family_vendor_homepage': 'http://www.apple.com/',
'ua_string': 'Mozilla/5.0 (iPad; CPU OS 7_0 like Mac OS X) '
'AppleWebKit/537.51.1 (KHTML, like Gecko) Version/7.0 '
'Mobile/11A465 Safari/9537.53',
'ua_uptodate_current_version': '',
'ua_version': '7.0',
'ua_version_major': '7'}
>>>
>>> result = udger.parse_ua('Some thing')
>>> pprint(result)
{'crawler_category': None,
'crawler_category_code': None,
'crawler_last_seen': None,
'crawler_respect_robotstxt': None,
'device_brand': None,
'device_brand_code': None,
'device_brand_homepage': None,
'device_brand_icon': None,
'device_brand_icon_big': None,
'device_brand_info_url': None,
'device_class': None,
'device_class_code': None,
'device_class_icon': None,
'device_class_icon_big': None,
'device_class_info_url': None,
'device_marketname': None,
'os': None,
'os_code': None,
'os_family': None,
'os_family_code': None,
'os_family_vendor': None,
'os_family_vendor_code': None,
'os_family_vendor_homepage': None,
'os_homepage': None,
'os_icon': None,
'os_icon_big': None,
'os_info_url': None,
'ua': None,
'ua_class': 'Unrecognized',
'ua_class_code': 'unrecognized',
'ua_engine': None,
'ua_family': None,
'ua_family_code': None,
'ua_family_homepage': None,
'ua_family_icon': None,
'ua_family_icon_big': None,
'ua_family_info_url': None,
'ua_family_vendor': None,
'ua_family_vendor_code': None,
'ua_family_vendor_homepage': None,
'ua_string': 'Some thing',
'ua_uptodate_current_version': None,
'ua_version': None,
'ua_version_major': None}
>>>
>>> result = udger.parse_ip('69.89.31.120')
>>> pprint(result)
{'crawler_category': None,
'crawler_category_code': None,
'crawler_family': None,
'crawler_family_code': None,
'crawler_family_homepage': None,
'crawler_family_icon': None,
'crawler_family_info_url': None,
'crawler_family_vendor': None,
'crawler_family_vendor_code': None,
'crawler_family_vendor_homepage': None,
'crawler_last_seen': None,
'crawler_name': None,
'crawler_respect_robotstxt': None,
'crawler_ver': None,
'crawler_ver_major': None,
'datacenter_homepage': 'https://www.bluehost.com/',
'datacenter_name': 'Bluehost Inc.',
'datacenter_name_code': 'bluehost',
'ip': '69.89.31.120',
'ip_city': 'Provo',
'ip_classification': 'Web scraper',
'ip_classification_code': 'web_scraper',
'ip_country': 'United States',
'ip_country_code': 'US',
'ip_hostname': 'box320.bluehost.com',
'ip_last_seen': '2016-09-17 12:13:25',
'ip_ver': 4}
>>>
>>> result = udger.parse_ip('108.61.199.93')
>>> pprint(result)
{'crawler_category': 'Site monitor',
'crawler_category_code': 'site_monitor',
'crawler_family': 'PINGOMETER',
'crawler_family_code': 'pingometer',
'crawler_family_homepage': '',
'crawler_family_icon': 'bot_pingometer.png',
'crawler_family_info_url': 'https://udger.com/resources/ua-list/bot-detail?bot=PINGOMETER',
'crawler_family_vendor': 'Pingometer, LLC',
'crawler_family_vendor_code': 'pingometer_llc',
'crawler_family_vendor_homepage': 'http://pingometer.com/',
'crawler_last_seen': '2016-09-17 12:15:38',
'crawler_name': 'PINGOMETER',
'crawler_respect_robotstxt': 'no',
'crawler_ver': '',
'crawler_ver_major': '',
'datacenter_homepage': 'https://www.choopa.com/',
'datacenter_name': 'Choopa, LLC.',
'datacenter_name_code': 'choopa',
'ip': '108.61.199.93',
'ip_city': 'Amsterdam',
'ip_classification': 'Crawler',
'ip_classification_code': 'crawler',
'ip_country': 'Netherlands',
'ip_country_code': 'NL',
'ip_hostname': '108.61.199.93.vultr.com',
'ip_last_seen': '2016-09-17 12:00:31',
'ip_ver': 4}
Data directory
--------------
``Udger()`` parser expects the data file to be placed in the system temporary
directory as returned by the ``tempfile.gettempdir()``.
You may override the path using the argument like this:
udger = Udger('/var/cache/udger/')
Forked from
-----------
Based on the code by Jure Ham (jure.ham@zemanta.com),
https://github.com/hamaxx/uasparser2
Previously, a python version of https://github.com/kaittodesk/uasparser2
by Hicro Kee (http://hicrokee.com) email: hicrokee AT gmail DOT com
and modified by Michal Molhanec http://molhanec.net
Documentation for developers
----------------------------
https://udger.com/pub/documentation/parser/Python/html/
Author
------
The Udger.com Team (info@udger.com)
Old v1 format
-------------
If you still use the previous format of the db (v1), please see the branch ``old_format_v1``
=====================================
Local parser is very fast and accurate useragent string detection solution. Enables developers to locally install and integrate a highly-scalable product.
We provide the detection of the devices (personal computer, tablet, Smart TV, Game console etc.), operating system and client SW type (browser, e-mail client etc.).
It also provides information about IP addresses (Public proxies, VPN services, Tor exit nodes, Fake crawlers, Web scrapers .. etc.)
- Tested with more the 50.000 unique user agents.
- Up to date data provided by https://udger.com/
- Support for Python 3
Enjoy!
Install using pip
-----------------
$ pip install udger
Install from git repo
---------------------
$ git clone https://github.com/udger/udger-python
$ cd udger-python/
# python setup.py install
Automatic updates download
--------------------------
For data auto update, please use Udger data updater (https://udger.com/support/documentation/?doc=62)
Help us
-------
Feel free to send us a Pull Request on GitHub to help us make Udger for Python better.
Or just let us know of any issues you encounter.
Thank you!
Usage
-----
$ python
>>> from pprint import pprint
>>> from udger import Udger
>>> udger = Udger()
>>>
>>> result = udger.parse_ua(
... 'Mozilla/5.0 (iPad; CPU OS 7_0 like Mac OS X) AppleWebKit/537.51.1 (KHTML, like Gecko) Version/7.0 Mobile/11A465 Safari/9537.53'
... )
>>> pprint(result)
{'crawler_category': None,
'crawler_category_code': None,
'crawler_last_seen': None,
'crawler_respect_robotstxt': None,
'device_brand': 'Apple',
'device_brand_code': 'apple',
'device_brand_homepage': 'http://www.apple.com/',
'device_brand_icon': 'apple.png',
'device_brand_icon_big': 'apple_big.png',
'device_brand_info_url': 'https://udger.com/resources/ua-list/devices-brand-detail?brand=apple',
'device_class': 'Tablet',
'device_class_code': 'tablet',
'device_class_icon': 'tablet.png',
'device_class_icon_big': 'tablet_big.png',
'device_class_info_url': 'https://udger.com/resources/ua-list/device-detail?device=Tablet',
'device_marketname': 'iPad',
'os': 'iOS 7',
'os_code': 'ios_7',
'os_family': 'iOS',
'os_family_code': 'ios',
'os_family_vendor': 'Apple Inc.',
'os_family_vendor_code': 'apple_inc',
'os_family_vendor_homepage': 'http://www.apple.com/',
'os_homepage': 'https://en.wikipedia.org/wiki/IOS_7',
'os_icon': 'iphone.png',
'os_icon_big': 'iphone_big.png',
'os_info_url': 'https://udger.com/resources/ua-list/os-detail?os=iOS%207',
'ua': 'Safari mobile 7.0',
'ua_class': 'Mobile browser',
'ua_class_code': 'mobile_browser',
'ua_engine': 'WebKit',
'ua_family': 'Safari mobile',
'ua_family_code': 'safari_mobile',
'ua_family_homepage': 'https://en.wikipedia.org/wiki/Safari_%28web_browser%29',
'ua_family_icon': 'safari.png',
'ua_family_icon_big': 'safari_big.png',
'ua_family_info_url': 'https://udger.com/resources/ua-list/browser-detail?browser=Safari%20mobile',
'ua_family_vendor': 'Apple Inc.',
'ua_family_vendor_code': 'apple_inc',
'ua_family_vendor_homepage': 'http://www.apple.com/',
'ua_string': 'Mozilla/5.0 (iPad; CPU OS 7_0 like Mac OS X) '
'AppleWebKit/537.51.1 (KHTML, like Gecko) Version/7.0 '
'Mobile/11A465 Safari/9537.53',
'ua_uptodate_current_version': '',
'ua_version': '7.0',
'ua_version_major': '7'}
>>>
>>> result = udger.parse_ua('Some thing')
>>> pprint(result)
{'crawler_category': None,
'crawler_category_code': None,
'crawler_last_seen': None,
'crawler_respect_robotstxt': None,
'device_brand': None,
'device_brand_code': None,
'device_brand_homepage': None,
'device_brand_icon': None,
'device_brand_icon_big': None,
'device_brand_info_url': None,
'device_class': None,
'device_class_code': None,
'device_class_icon': None,
'device_class_icon_big': None,
'device_class_info_url': None,
'device_marketname': None,
'os': None,
'os_code': None,
'os_family': None,
'os_family_code': None,
'os_family_vendor': None,
'os_family_vendor_code': None,
'os_family_vendor_homepage': None,
'os_homepage': None,
'os_icon': None,
'os_icon_big': None,
'os_info_url': None,
'ua': None,
'ua_class': 'Unrecognized',
'ua_class_code': 'unrecognized',
'ua_engine': None,
'ua_family': None,
'ua_family_code': None,
'ua_family_homepage': None,
'ua_family_icon': None,
'ua_family_icon_big': None,
'ua_family_info_url': None,
'ua_family_vendor': None,
'ua_family_vendor_code': None,
'ua_family_vendor_homepage': None,
'ua_string': 'Some thing',
'ua_uptodate_current_version': None,
'ua_version': None,
'ua_version_major': None}
>>>
>>> result = udger.parse_ip('69.89.31.120')
>>> pprint(result)
{'crawler_category': None,
'crawler_category_code': None,
'crawler_family': None,
'crawler_family_code': None,
'crawler_family_homepage': None,
'crawler_family_icon': None,
'crawler_family_info_url': None,
'crawler_family_vendor': None,
'crawler_family_vendor_code': None,
'crawler_family_vendor_homepage': None,
'crawler_last_seen': None,
'crawler_name': None,
'crawler_respect_robotstxt': None,
'crawler_ver': None,
'crawler_ver_major': None,
'datacenter_homepage': 'https://www.bluehost.com/',
'datacenter_name': 'Bluehost Inc.',
'datacenter_name_code': 'bluehost',
'ip': '69.89.31.120',
'ip_city': 'Provo',
'ip_classification': 'Web scraper',
'ip_classification_code': 'web_scraper',
'ip_country': 'United States',
'ip_country_code': 'US',
'ip_hostname': 'box320.bluehost.com',
'ip_last_seen': '2016-09-17 12:13:25',
'ip_ver': 4}
>>>
>>> result = udger.parse_ip('108.61.199.93')
>>> pprint(result)
{'crawler_category': 'Site monitor',
'crawler_category_code': 'site_monitor',
'crawler_family': 'PINGOMETER',
'crawler_family_code': 'pingometer',
'crawler_family_homepage': '',
'crawler_family_icon': 'bot_pingometer.png',
'crawler_family_info_url': 'https://udger.com/resources/ua-list/bot-detail?bot=PINGOMETER',
'crawler_family_vendor': 'Pingometer, LLC',
'crawler_family_vendor_code': 'pingometer_llc',
'crawler_family_vendor_homepage': 'http://pingometer.com/',
'crawler_last_seen': '2016-09-17 12:15:38',
'crawler_name': 'PINGOMETER',
'crawler_respect_robotstxt': 'no',
'crawler_ver': '',
'crawler_ver_major': '',
'datacenter_homepage': 'https://www.choopa.com/',
'datacenter_name': 'Choopa, LLC.',
'datacenter_name_code': 'choopa',
'ip': '108.61.199.93',
'ip_city': 'Amsterdam',
'ip_classification': 'Crawler',
'ip_classification_code': 'crawler',
'ip_country': 'Netherlands',
'ip_country_code': 'NL',
'ip_hostname': '108.61.199.93.vultr.com',
'ip_last_seen': '2016-09-17 12:00:31',
'ip_ver': 4}
Data directory
--------------
``Udger()`` parser expects the data file to be placed in the system temporary
directory as returned by the ``tempfile.gettempdir()``.
You may override the path using the argument like this:
udger = Udger('/var/cache/udger/')
Forked from
-----------
Based on the code by Jure Ham (jure.ham@zemanta.com),
https://github.com/hamaxx/uasparser2
Previously, a python version of https://github.com/kaittodesk/uasparser2
by Hicro Kee (http://hicrokee.com) email: hicrokee AT gmail DOT com
and modified by Michal Molhanec http://molhanec.net
Documentation for developers
----------------------------
https://udger.com/pub/documentation/parser/Python/html/
Author
------
The Udger.com Team (info@udger.com)
Old v1 format
-------------
If you still use the previous format of the db (v1), please see the branch ``old_format_v1``
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
udger-4.0.0.tar.gz
(12.4 kB
view details)
Built Distribution
udger-4.0.0-py3-none-any.whl
(12.8 kB
view details)
File details
Details for the file udger-4.0.0.tar.gz
.
File metadata
- Download URL: udger-4.0.0.tar.gz
- Upload date:
- Size: 12.4 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | 69ce9306be3a4eb7552e7bc9956d85e42cf3f8ad1972667c33ead81f8209fdc5 |
|
MD5 | e9782b150a77f8789c5d4d8d0ef3fc75 |
|
BLAKE2b-256 | 7a3638727cc3950023d56cc241e764382bafcda403b6e9598a3e8c0a7bbb725a |
File details
Details for the file udger-4.0.0-py3-none-any.whl
.
File metadata
- Download URL: udger-4.0.0-py3-none-any.whl
- Upload date:
- Size: 12.8 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | 6dd306bbbc34c31e108ee911e880dbda6db971430e98ebd29b978f04c99a5f42 |
|
MD5 | 884105c061cabe0a7dc159d45ac63258 |
|
BLAKE2b-256 | b8165129f07e011217aa339d12afd66c4ee9338a93e62c8c9e2ec9a305d66fe3 |