Practicum: R Studio Tutorial In-Class Presentations: Student 1 and Student 2 Description: This discussion portion of this session will focus on introducing key concepts in computer programming and text processing. The practicum will be devoted to a tutorial on working with R using R Studio.
Practicum: Data Scraping and Cleaning In-Class Presentations: Student 3 and Student 4 Description: The focus of this unit is the process of corpus creation through data scraping and cleaning. We will learn how to scrape content from the web and to process it for future analysis. Discussion will focus on
Practicum: Tokenizing, lemmatization, frequency and correlation analysis In-Class Presentations: Student 5 and Student 6 Description: This session will focus on basic text processing techniques required as the basis for nearly all modes of textual analysis. Topics covered will include stemming, lemmatization, semantic reduction, naïve Bayesian classification, and word frequency analysis.
Practicum: Lexical Correlation and Lexical Variety In-Class Presentations: Student 7 and Student 8 Description: This unit focuses on basic modes of machine textual “reading” by analyzing of the words on the page and their relationships to each other. We will learn to perform various modes of machine reading and also
Practicum: Clustering and Topic Modeling In-Class Presentations: Student 9 and Student 10 Description: This unit focuses on modes of modeling textual content. We will learn how to build several types of models and discuss the math behind them. We will also investigate their use and misuse, and the impact of