Monday, April 4, 2016 at 5:45 PM @felixsalmon @gabrielsnyder Well... # of documents (11.5MM), total corpus size in words, # of proper names/nouns, # of links