Table of Contents:
  • Introduction
  • Handling Textual Data
  • Regular Expressions
  • Basic Operations with Strings
  • Reading and Writing Files
  • Basic Corpus Statistics
  • Statistical Models
  • Geometrical Models
  • Dimensionality Reduction
  • Document Categorization
  • Document Search
  • Content Analysis.