96 million memes from the Memetracker. Memetracker tracks the quotes and phrases that appear most frequently over time across this entire online news spectrum. This makes it possible to see how different stories compete for news and blog coverage each day, and how certain stories persist while others fade quickly.
Overall Memetracker tracks more than 17 million different phrases and about 54% of the total phrase/quote mentions appear on blos and 46% in news media.
For each document (blog post or news media article):
Dataset statistics | |
---|---|
Number of documents | 96,608,034 |
Number of memes | 210,999,824 |
Number of links | 418,237,269 |
File | Description |
---|---|
quotes_2008-08.txt.gz | Memes and links from Aug 2008 |
quotes_2008-09.txt.gz | Memes and links from Sep 2008 |
quotes_2008-10.txt.gz | Memes and links from Oct 2008 |
quotes_2008-11.txt.gz | Memes and links from Nov 2008 |
quotes_2008-12.txt.gz | Memes and links from Dec 2008 |
quotes_2009-01.txt.gz | Memes and links from Jan 2009 |
quotes_2009-02.txt.gz | Memes and links from Feb 2009 |
quotes_2009-03.txt.gz | Memes and links from Mar 2009 |
quotes_2009-04.txt.gz | Memes and links from Apr 2009 |
where the first letter of the line encodes: