Twitter Data Collection tools released

Aug 5, 2012 · 1 min read

I have released the Java programs that I’ve been using to collect data from Twitter for my PhD since November 2009.

It is suite of programs that use the Twitter Search API and Twitter Stream API to get tweets and then store them in a mySQL database. They have been tested on Mac OS X and Debian Linux.

The Java source is split into 9 Eclipse projects using Maven to bring in the external library dependencies.

For people who just want to use the tools, I’ve included the runnable Java files for the main modules and the shell scripts and a suggested cron file to run them in the Example directory. The README.MD file in the Example directory has instructions on how to create the mySQL databases and configuration files.

The tStreamingArchiver project code is available on GitHub: https://github.com/brendam/tStreamingArchiver.

Please tell me if you find it useful.

Dr Brenda Moon
Authors
Retired
Managing bushland property in the Victorian central goldfields region. Retired social media researcher - my primary research interest was using interdisciplinary approaches to apply and develop digital methods.