This repository hosts the Omniglot dataset for one-shot learning, containing handwritten characters across multiple alphabets along with stroke data. It includes both MATLAB and Python starter scripts (e.g. demo.m, demo.py) to illustrate how to load the images and stroke sequences and run baseline experiments (such as classification by modified Hausdorff distance). The dataset provides both an image representation of each character and the time-ordered stroke coordinates ([x, y, t]) for each instance. Includes stroke data (time-sequenced coordinates) per sample. The repository is intended as a benchmark dataset in few-shot / meta-learning research, not as a plug-and-play detection or classification engine. Pre-split “background” and “evaluation” alphabets for standard benchmarking.

Features

  • Contains 1,623 characters drawn by 20 different people each
  • Includes stroke data (time-sequenced coordinates) per sample
  • Supplies MATLAB and Python demo scripts for usage
  • Pre-split “background” and “evaluation” alphabets for standard benchmarking
  • Support for “minimal” splits with fewer background alphabets
  • Easily extensible / usable as a benchmark dataset for one-shot methods

Project Samples

Project Activity

See All Activity >

Categories

Algorithms

License

MIT License

Follow Omniglot

Omniglot Web Site

Other Useful Business Software
Go from Code to Production URL in Seconds Icon
Go from Code to Production URL in Seconds

Cloud Run deploys apps in any language instantly. Scales to zero. Pay only when code runs.

Skip the Kubernetes configs. Cloud Run handles HTTPS, scaling, and infrastructure automatically. Two million requests free per month.
Try it free
Rate This Project
Login To Rate This Project

User Reviews

Be the first to post a review of Omniglot!

Additional Project Details

Operating Systems

Linux, Mac, Windows

Programming Language

MATLAB

Related Categories

MATLAB Algorithms

Registered

2025-09-29