Culturomics
Culturomics is a form of computational lexicology that studies human behavior and cultural trends through the quantitative analysis of digitized texts.[1][2] Researchers data mine large digital archives to investigate cultural phenomena reflected in language and word usage.[3] The term is an American neologism first described in a 2010 Science article called Quantitative Analysis of Culture Using Millions of Digitized Books, co-authored by Harvard researchers Jean-Baptiste Michel and Erez Lieberman Aiden.[4]
Michel and Aiden helped create the Google Labs project Google Ngram Viewer which uses n-grams to analyze the Google Books digital library for cultural patterns in language use over time.
Studies
In a study called Culturomics 2.0, Kalev H. Leetaru examined news archives including print and broadcast media (television and radio transcripts) for words that imparted tone or "mood" as well as geographic data.[5][6] The research was able to retroactively predict the 2011 Arab Spring and successfully estimate the final location of Osama Bin Laden to within 124 miles.[5][6]
In a 2012 paper by Alexander M. Petersen and co-authors,[7] they found a "dramatic shift in the birth rate and death rates of words":[8] Deaths have increased and births have slowed. The authors also identified a universal "tipping point" in the life cycle of new words at about 30 to 50 years after their origin, they either enter the long-term lexicon or fall into disuse.[8]
In a 2014 paper by S. Roth, culturomic analyses is used to trace the decline of religion, the rise of politics, and the relevance of the economy to modern societies, with one of the major results being that modern societies do not appear to be capitalist or economized.[9] This paper is likely to be the first application of culturomics in sociology.
Criticism
Linguists and lexicographers have expressed skepticism regarding the methods and results of some of these studies, including one by Petersen et al.[10]
References
- ↑ Cohen, Patricia (16 December 2010). "In 500 Billion Words, New Window on Culture". New York Times.
- ↑ Hayes, Brian (May–June 2011). "Bit Lit". American Scientist 99 (3): 190. doi:10.1511/2011.90.190.
- ↑ Letcher, David W. (April 6, 2011). "Cultoromics: A New Way to See Temporal Changes in the Prevalence of Words and Phrases". American Institute of Higher Education 6th International Conference Proceedings 4 (1): 228.
- ↑ Michel, Jean-Baptiste; Liberman Aiden, Erez (16 December 2010). "Quantitative Analysis of Culture Using Millions of Digitized Books". Science 331 (6014): 176–82. doi:10.1126/science.1199644. PMC 3279742. PMID 21163965.
- ↑ 5.0 5.1 Leetaru, Kalev H. (5 September 2011). "Culturomics 2.0: Forecasting Large-Scale Human Behavior Using Global News Media Tone In Time And Space". First Monday 16 (9).
- ↑ 6.0 6.1 Quick, Darren (7 September 2011). "Culturomics research uses quarter-century of media coverage to forecast human behavior". Gizmag.com. Retrieved 9 September 2011.
- ↑ Petersen, Alexander M. (15 March 2012). "Statistical Laws Governing Fluctuations in Word Use from Word Birth to Word Death". Scientific Reports 2. doi:10.1038/srep00313.
- ↑ 8.0 8.1 "The New Science of the Birth and Death of Words ", CHRISTOPHER SHEA, Wall Street Journal, March 16, 2012
- ↑ Roth, S. (2014), "Fashionable functions. A Google ngram view of trends in functional differentiation (1800-2000)", International Journal of Technology and Human Interaction, Band 10, Nr. 2, S. 34-58 (online: http://ssrn.com/abstract=2491422).
- ↑ "When physicists do linguistics", BEN ZIMMER, Boston Globe, February 10, 2013
Further reading
- Michel, Jean-Baptiste; Liberman Aiden, Erez; Aiden, A. P.; Veres, A.; Gray, M. K.; Pickett, J. P.; Hoiberg, D.; Clancy, D.; Norvig, P.; Orwan, John; Nowak, Martin; Pinker, Steven (16 December 2010). "Quantitative Analysis of Culture Using Millions of Digitized Books". Science 331 (6014): 176–82. doi:10.1126/science.1199644. PMC 3279742. PMID 21163965.
- Leetaru, Kalev H. (5 September 2011). "Culturomics 2.0: Forecasting Large-Scale Human Behavior Using Global News Media Tone In Time And Space". First Monday 16 (9).
- Bohannon, John (14 January 2011). "Google Books, Wikipedia, and the Future of Culturomics". Science 331 (6014): 135. doi:10.1126/science.331.6014.135. PMID 21233356.
- Schwartz, Tim (1 April 2011). "Culturomics: Periodicals Gauge Culture's Pulse". Science 332 (6025): 35–36. doi:10.1126/science.332.6025.35-c.
- Morse-Gagné, Elise E. (1 April 2011). "Culturomics: Statistical Traps Muddy the Data". Science 332 (6025): 35. doi:10.1126/science.332.6025.35-b. PMID 21454771.
- Petersen, Alexander M.; Tenenbaum, Joel; Havlin, Shlomo; Stanley, H. Eugene (15 March 2012). "Statistical Laws Governing Fluctuations in Word Use from Word Birth to Word Death". Scientific Reports 2. doi:10.1038/srep00313.
- Petersen, Alexander M.; Tenenbaum, Joel; Havlin, Shlomo; Stanley, H. Eugene; Perc, Matjaz (10 December 2012). "Languages cool as they expand: Allometric scaling and the decreasing need for new words". Scientific Reports 2. doi:10.1038/srep00943.
- Shea, Christopher. "The New Science of the Birth and Death of Words". Wall Street Journal. Retrieved 15 January 2013.
- Acerbi, Alberto; Lampos, Vasileios; Garnett, Philip; Bentley, Alexander (20 March 2013). "The Expression of Emotions in 20th Century Books". PLoS ONE 8 (3): e59030. doi:10.1371/journal.pone.0059030. PMID 23527080.
- Bentley, Alexander; Acerbi, Alberto; Ormerod, Paul; Lampos, Vasileios (8 January 2014). "Books Average Previous Decade of Economic Misery". PLoS ONE 9 (1). doi:10.1371/journal.pone.0083147.
External links
- Culturomics.org, website by The Cultural Observatory at Harvard directed by Erez Lieberman Aiden and Jean-Baptiste Michel