Skip to main content

A twarc plugin to extract hashtags from Twitter data

Project description

twarc-hashtags

This module is extends twarc with a hashtags command that will extract and count the hashtags in a tweet dataset.

Install

pip install twarc-hashtags

Collect some Twitter data, for example:

twarc2 search blacklivesmatter tweets.jsonl 

Because you installed the plugin you have a new subcommand hashtags:

twarc2 hashtags tweets.jsonl hashtags.csv

Then open hashtags.csv in your favourite spreadsheet program or DataFrame library.

Behind the scenes twarc-hashtags uses Python's native support for SQLite to create a database and then insert/query it. You can see this database after the program finishes as hashtags.db in your current working directory.

Options

--group: group results by day, week, month, year

--limit: limit to this number of hashtags (per group if --group is used)

--db: if you would like to name the database something other than hashtags.db

--no-insert: use an existing database instead of inserting (useful for large numbers of tweets)

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

twarc-hashtags-0.0.5.tar.gz (3.7 kB view details)

Uploaded Source

File details

Details for the file twarc-hashtags-0.0.5.tar.gz.

File metadata

  • Download URL: twarc-hashtags-0.0.5.tar.gz
  • Upload date:
  • Size: 3.7 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/3.2.0 pkginfo/1.5.0.1 requests/2.22.0 setuptools/45.2.0 requests-toolbelt/0.9.1 tqdm/4.60.0 CPython/3.8.10

File hashes

Hashes for twarc-hashtags-0.0.5.tar.gz
Algorithm Hash digest
SHA256 d557840675bb6841a520f2fc49925cdc53977a126f13972a043e4fe3074385cc
MD5 93cefc8f925cb2bd085d02b104dcec6f
BLAKE2b-256 e487816e0509726b1431e53fadb4648afcbb0b3069e7835eda1ebdc713d00be5

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page