This R package facilitates replication of Twitter-based research by providing a convenient function to download lists of tweets.
The input for the package is a list of tweet ID numbers. See https://archive.org/details/gaza-tweets for an example.
The output of the package are the tweets downloaded as a tibble or as JSON files. Examples for both are below.
This package limits the rate of tweet downloading so Twitter’s 90,000 tweet/15 minute limit is not exceeded. If you choose to download the tweets to JSON files, then a new JSON file will be created for every 90,000 tweet ID numbers.
Tweets that have been deleted or made private cannot be downloaded.
Users must acquire a consumer key, consumer secret, access token, and access token secret from https://developer.twitter.com on their own.
Tibble Download Example
JSON File Download Example
kevincoakley, with zacharyst sending annoying e-mails.
Added a parameter called group_start that takes the list of split tweet IDs and keeps only those from group_start to the final list. That way, if a download is interrupted, which is likely for large corpuses, the user can restart the download at the group_start chunk, not from the beginning.
Added a line to print an estimate of how long a download will take
- Prepping for CRAN release
- Renamed Project to rehydratoR
- Save tweets to JSON files
- Changed rate limiting from waiting 915 seconds after every loop to timing the length of every loop and ensuring every loop is at least 915 seconds
- Rate limiting
- Added example file
- Initial Release
- Kevin Coakley - programming, testing
- Zachary Steinert-Threlkeld - requirements, testing, promoting
This project is licensed under the BSD License - see the LICENSE.md file for details
- Thank you to the support Christine Kirkpatrick and the San Diego Supercomputer Center at UCSD.