An exploratory look at 257,093 #JeSuisAhmed tweets

#JeSuisAhmed

Had some time last night to do some exploratory analysis on some of the #JeSuisAhmed collection. This analysis is from the first tweet I was able to harvest #JeSuisAhmed to some time on January 14, 2015 when I copied over the json to experiment with a few of the twarc utilities.

First tweet in data set:


Last tweet in data set:

Hydration

If you want to experiment/follow along with what I've done here, you can "rehydrate" the data set with twarc. You can grab the Tweet ids for #JeSuisAhmed from here (Data & Analysis tab).

% twarc.py --hydrate JeSuisAhmed-ids-20150113.txt > JeSuisAhmed-tweets-20150113.json

The hydration process will take some time. I'd highly suggest using GNU Screen or tmux, and grabbing a cup of coffee.

Map

#JeSuisAhmed tweets with geo coordinates.

In this data set, we have 2,329 tweets with geo coordinates availble. This represents about 0.91% of the entire data set (257,093 tweets).

How do you make this?

  • Create the geojson ~/git/twarc/utils/geojson.py JeSuisAhmed-tweets-dedupe-20150112.json > JeSuisAhmed-tweets-dedupe-20150112.geojson

  • Give the geojson a variable name.

  • Use Leaflet.js to put all the tweets with geo coordinates on a map like this.

Images

These are the image urls that have more than 1000 occurrences in the data set.

13703 Occurrences

img

10396 Occurrences

img

6088 Occurrences

img

4354 Occurrences

img

3229 Occurrences

img

3124 Occurrences

img

2307 Occurrences

img

2034 Occurrences

img

1949 Occurrences

img

1296 Occurrences

img

1182 Occurrences

img

1100 Occurrences

img

How do you get the image list (requires unshrtn)?

% ~/git/twarc/utils/image_urls.py JeSuisAhmed-tweets-unshortened-20150112.json > JeSuisAhmed-images-20150112.txt
% cat JeSuisAhmed-images-20150112.txt | sort | uniq -c | sort -rn > JeSuisAhmed-images-ranked-20150112.txt

The ranked url data set can be found here.

Retweets

What are the three most retweeted tweets in the hashtag?




How do you find out the most retweets tweets in the dataset? This will give you the top 10.

~/git/twarc/utils/retweets.py JeSuisAhmed-tweets-dedupe-20150112.json > JeSuisAhmed-retweets-20150112.json

Top URLs

Top 10 URLs tweeted from #JeSuisAhmed.

  1. http://www.huffingtonpost.ca/2015/01/08/ahmed-merabet-jesuisahmed-charlie-hebdo_n_6437984.html?ncid=tweetlnkushpmg00000067 (2895)
  2. http://limportant.fr/infos-jesuischarlie/76/360460 (1613)
  3. http://mic.com/articles/107988/the-hero-of-the-charlie-hebdo-shooting-we-re-overlooking (1318)
  4. http://www.huffingtonpost.co.uk/2015/01/08/charlie-hebdocharlie-hebdo-attack-jesuisahmed-hashtag-commemorating-ahmed-merabet-takes-off_n_6436528.html?1420731418&ncid=tweetlnkushpmg00000067 (919)
  5. http://www.huffingtonpost.com/2015/01/08/jesuisahmed-twitter-hashtag_n_6438132.html?ncid=tweetlnkushpmg00000067 (632)
  6. http://www.huffingtonpost.com/2015/01/08/jesuisahmed-twitter-hashtag_n_6438132.html?ncid=tweetlnkushpmg00000055 (592)
  7. http://www.dailymail.co.uk/news/article-2901681/Hero-police-officer-executed-street-married-42-year-old-Muslim-assigned-patrol-Paris-neighbourhood-Charlie-Hebdo-offices-located.html (571)
  8. http://blogs.mediapart.fr/blog/joel-villain/070115/il-sappelait-ahmed (555)
  9. http://www.bbc.co.uk/news/blogs-trending-30728491?ocid=socialflow_twitter (471)
  10. http://www.huffingtonpost.com/2015/01/08/jesuisahmed-twitter-hashtag_n_6438132.html?utm_hp_ref=tw (436)

Full list of urls can be found here.

How do you get the list (requires unshrtn)?

% cat JeSuisAhmed-tweets-20150112.json | ~/git/twarc/utils/unshorten.py > JeSuisAhmed-tweets-unshortened-20150112.json
% cat JeSuisAhmed-tweets-unshortened-20150112.json | ~/git/twarc/utils/urls.py| sort | uniq -c | sort -rn > JeSuisAhmed-urls.txt

Twitter Clients

Top 10 Twitter clients used from #JeSuisAhmed.

  1. Twitter for iPhone (85116)
  2. Twitter for Android (58819)
  3. Twitter Web Client (58166)
  4. Twitter for iPad (15304)
  5. Twitter for Websites (6877)
  6. Twitter for Windows Phone (5237)
  7. Twitter for Android Tablets (4420)
  8. TweetDeck (3790)
  9. Mobile Web (M5) (1708)
  10. Tweetbot for iΟS (1691)

Full list of clients can be found here.

How do you get the list of Twitter client sources?

% ~/git/twarc/utils/source.py JeSuisAhmed-tweets-20150112.json > JeSuisAhmed-sources-20150112.html