tidytext brings tidy data principles to text mining by converting text into a tidy data frame format. It provides tools for tokenization, sentiment analysis, n‑gram creation, and term‑document matrices, enabling interoperability with dplyr, ggplot2, and other tidyverse workflows.
Features
- Tokenizes text into tidy format (unnest_tokens)
- Supports sentiment lexicons (e.g. Bing, NRC) and TF-IDF computation
- Converts tm or quanteda objects into tidy data formats
- Easy integration with dplyr/ggplot2 for analysis and visualization
- Functions for n-grams, word co-occurrence, and document-term matrices
- Compatible with existing tidy data pipelines in R
Categories
Natural Language Processing (NLP)Follow tidytext
Other Useful Business Software
Your monitoring isn't a stack. It's a pile. Fix that.
Replace Datadog, New Relic, and Sentry without adding three more dashboards.
Rate This Project
Login To Rate This Project
User Reviews
Be the first to post a review of tidytext!