• If you are citizen of an European Union member nation, you may not use this service unless you are at least 16 years old.

  • You already know Dokkio is an AI-powered assistant to organize & manage your digital files & messages. Very soon, Dokkio will support Outlook as well as One Drive. Check it out today!


Class 7 Notes

Page history last edited by Alan Liu 9 years, 2 months ago

Preliminary Class Business



1. Thinking Ahead to the Class Project



  • Transcriptions Research Slam, "SynchDH" (May 8th)
  • Your initial ideas about a corpus to study?


2. Topic Modeling


Readings for Class 6:

Other resources:


  • Building blocks of text analysis:
    1. Counting (frequency)
    2. Co-occurrence (collocation)
    3. Clustering
    4. Comparison with reference corpus (as "corpus" is understood in field of corpus linguistics)
    5. Other important supporting or complementary methods of text analysis:
      • Parts-of-speech analysis (POS)
      • Named entity recognition (NER)
      • Sentiment analysis
    6. Visualization
  • Currently leading-edge advanced methods of text analysis that build on top of the lower-level "building block" methods above:
    • topic modeling
    • social network analysis
  • The idea of topic modeling


3. Your Practicums





Comments (0)

You don't have permission to comment on this page.