Practicum: Clustering and Topic Modeling
In-Class Presentations: Student 9 and Student 10
Description: This unit focuses on modes of modeling textual content. We will learn how to build several types of models and discuss the math behind them. We will also investigate their use and misuse, and the impact of modeling on humanities scholarship.
- Willard McCarty, “Modeling: A Study in Words and Meanings.” A Companion to Digital Humanities, ed. Susan Schreibman, Ray Siemens, John Unsworth. Oxford: Blackwell, 2004. Chapter 19. [http://www.digitalhumanities.org/companion/].
- Benjamin M. Schmidt, “Words Alone: Dismantling Topic Models in the Humanities.” Journal of Digital Humanities, 2.1 (Winter 2012) [http://journalofdigitalhumanities.org/2-1/words-alone-by-benjamin-m-schmidt/].
- Ted Underwood, “Topic Modeling Made Just Simple Enough.” Personal Blog [http://tedunderwood.com/2012/04/07/topic-modeling-made-just-simple-enough/].
- Jockers, Matthew Lee, Text Analysis with R for Students of Literature. Cham: Springer-Verlag, 2014. Chapters 11-13. [http://harvest.lib.ucdavis.edu/F/?func=direct&doc_number=003745646&local_base=UCD01PUB].
- Edwin Chen, “Introduction to Latent Dirichlet Allocation.” Personal blog. [http://blog.echen.me/2011/08/22/introduction-to-latent-dirichlet-allocation/].
- Mark Steyvers and Tom Griffiths, “Probabilistic Topic Models.” In T. Landauer, D McNamara, S. Dennis, and W. Kintsch (eds), Latent Semantic Analysis: A Road to Meaning [http://psiexp.ss.uci.edu/research/papers/SteyversGriffithsLSABookFormatted.pdf].