data.table is an R package that extends base R’s data.frame for high-performance data manipulation. It offers concise syntax, blazing speed, and memory-efficient operations. It supports fast file reading/writing, joins, grouping, reshaping, and updates by reference. It is heavily used in large data workflows, big data in R, production pipelines, etc. Extremely efficient grouping/aggregation/summarization; can handle very large datasets (hundreds of millions to billions of rows) in memory (if available). Relies only on base R; maintained API, active community; good memory efficiency. Non-equi joins, overlapping range joins, ordered joins, joining with aggregations, etc.
Features
- Very fast I/O: fread() for reading delimited files, fwrite() for writing them efficiently
- Extremely efficient grouping / aggregation / summarization; can handle very large datasets (hundreds of millions to billions of rows) in memory (if available)
- Fast / flexible joins: non-equi joins, overlapping range joins, ordered joins, joining with aggregations etc.
- In-place (by reference) column creation, updates, deletions to avoid copying large datasets
- Reshaping capabilities: melt / dcast (long ↔ wide), etc.
- Minimal dependencies: relies only on base R; maintained API, active community; good memory efficiency
Categories
Package ManagersLicense
Mozilla Public License 1.0 (MPL)Follow data.table
Other Useful Business Software
$300 Free Credits to Build on Google Cloud
Start your next project with $300 in free Google Cloud credit. Spin up VMs, run containers, query petabytes in BigQuery, or build agents with Gemini Enterprise Agent Platform. Once your credits are used, keep building with 20+ always-free tier products including Compute Engine, Cloud Storage, GKE, and Cloud Run functions. No commitment required—just sign up and start building.
Rate This Project
Login To Rate This Project
User Reviews
Be the first to post a review of data.table!