Corrected bullet points on frequency distribution and bag of words slides.
Added layout for corpus analysis. Added slides to clarify bag of words scoring, slides that describe Word2Vec, and slides that describe GloVe. Noralized styling of slides and added references for word embedding.
Added worked solutions to stemmer in-class problems. Clarified description slides of lemmatization. Added 2 in-class problems for lemmatization.
Added slide comparing Lancaster and Porter stemmer. Capitalized Lancaster and Porter. Consolidated code to create side-by-side comparison between stemmers.
Combined basic text analysis and text classificatoin slides. Added slides for bag of words and word embedding. Created python file for corpus analysis examples. Clarified POS tagger.
Added slides and code for lemmatization. Added references. Clarified stemming slides, and added stemming example code.
Added details to stemmer slides and part-of-speech slides. Clarified class exercise slide for tokenization. Moved separated stemming slide back with the rest of the stemming section.
Added code for stemmers. Added code to generate part-of-speech acronyms and their meanings. Cleaned up var names and comments.