There will be no class meeting on October 13. Use the time to complete your text markup and to work on your in-class presentation.
Practicum: Unix/Linux Command Line and Git basics Description:This predominantly practical session will be devoted to the basics of working with the Unix/Linux command-line and the Git version control repository.
Practicum: R Studio Tutorial In-Class Presentations: Student 1 and Student 2 Description: This discussion portion of this session will focus on introducing key concepts in computer programming and text processing. The practicum will be devoted to a tutorial on working with R using R Studio.
Practicum: Data Scraping and Cleaning In-Class Presentations: Student 3 and Student 4 Description: The focus of this unit is the process of corpus creation through data scraping and cleaning. We will learn how to scrape content from the web and to process it for future analysis. Discussion will focus on
Practicum: Tokenizing, lemmatization, frequency and correlation analysis In-Class Presentations: Student 5 and Student 6 Description: This session will focus on basic text processing techniques required as the basis for nearly all modes of textual analysis. Topics covered will include stemming, lemmatization, semantic reduction, naïve Bayesian classification, and word frequency analysis.