Hadoop Twitter Analytics

So a while back, Twitter rolled out the option to download the entire archive of all the tweets. This got me thinking what if we wanted to analyze the tweets using Hadoop & its ecosystem to draw out interesting facts about the way we tweet, retweet etc.

Introduction

Hadoop Twitter Analytics aims to develop a solution for analyzing tweets using Hadoop & related technologies. This project currently contains fetching the Twitter IDs of those users whose statuses have been retweeted the most by the user whose tweets are being analyzed. Multiple analytics will continue to be added as I work on developing this project.

Fork me on GitHub

Fork me on GitHub

Continue reading