Facebook Twitter Gplus RSS
magnify
Home Digital Methods November 10: Basic approaches to textual analysis
formats

November 10: Basic approaches to textual analysis

Practicum: Tokenizing, lemmatization, frequency and correlation analysis
In-Class Presentations: Student 5 and Student 6

Description: This session will focus on basic text processing techniques required as the basis for nearly all modes of textual analysis. Topics covered will include stemming, lemmatization, semantic reduction, naïve Bayesian classification, and word frequency analysis. We will discuss the hardware and software based biases that computers bring to these tasks and how these affect, direct, expand, and/or limit how human scholars engage with text as data.


Required Reading:

Source Reading:

 
 Share on Facebook Share on Twitter Share on Reddit Share on LinkedIn
No Comments  comments 
© 2015 by Carl G Stahmer
All Rights Reserved