This package represents a community effort to provide a common interface for accessing common Machine Learning (ML) datasets. In contrast to other data-related Julia packages, the focus of MLDatasets.jl is specifically on downloading, unpacking, and accessing benchmark datasets. Functionality for the purpose of data processing or visualization is only provided to a degree that is special to some datasets.

Features

  • Datasets are grouped into different categories
  • The way MLDatasets.jl is organized is that each dataset is its own type
  • Datasets with an underlying graph structure: Cora, PubMed, CiteSeer
  • Datasets that do not fall into any of the other categories: Iris, BostonHousing
  • Datasets for language models
  • Vision related datasets such as MNIST, CIFAR10, CIFAR100

Project Samples

Project Activity

See All Activity >

Categories

Machine Learning

License

MIT License

Follow MLDatasets.jl

MLDatasets.jl Web Site

Other Useful Business Software
Try Google Cloud Risk-Free With $300 in Credit Icon
Try Google Cloud Risk-Free With $300 in Credit

No hidden charges. No surprise bills. Cancel anytime.

Use your credit across every product. Compute, storage, AI, analytics. When it runs out, 20+ products stay free. You only pay when you choose to.
Start Free
Rate This Project
Login To Rate This Project

User Reviews

Be the first to post a review of MLDatasets.jl!

Additional Project Details

Programming Language

Julia

Related Categories

Julia Machine Learning Software

Registered

2023-11-17