Class Central is learner-supported. When you buy through links on our site, we may earn an affiliate commission.

LinkedIn Learning

Processing Text with R Essential Training

via LinkedIn Learning

Overview

Learn key techniques for cleansing and processing text in R, and discover how to convert text to a form that's ready for analytics and predictions.

Syllabus

Introduction
  • The emergence of text analytics
1. Introduction to Text Mining
  • Purpose
  • Document
  • Corpus
  • R text processing libraries
  • Setting up the environment
2. Corpus in R
  • PCorpus and VCorpus
  • Reading files with CorpusReader
  • Exploring the corpus
  • Persisting the corpus
3. Text Cleansing and Extraction
  • Setup for processing
  • Cleansing text
  • Stop word removal
  • Stemming
  • Managing metadata
4. TF-IDF
  • Introduction to tf-idf
  • Generating term frequency matrix
  • Improving term frequency matrix
  • Plotting term frequency
  • Generating tf-idf
5. N-Grams
  • N-grams concepts
  • Using RWeka NGramTokenizer
  • Creating an n-gram text frequency matrix
  • Extracting n-gram pairs
6. Best Practices
  • Storing text
  • Processing text data
  • Scalability
Conclusion
  • Next steps

Taught by

Kumaran Ponnambalam

Reviews

4.5 rating at LinkedIn Learning based on 31 ratings

Start your review of Processing Text with R Essential Training

Never Stop Learning.

Get personalized course recommendations, track subjects and courses with reminders, and more.

Someone learning on their laptop while sitting on the floor.