An easy-to-use & supercharged open-source experiment tracker
Generate audiobooks from e-books, voice cloning & 1107+ languages
Python binding to the Apache Tika™ REST services
TFX is an end-to-end platform for deploying production ML pipelines
Machine Learning Systems: Design and Implementation
Open Source Document Management System for Digital Archives
A high-quality tool for convert PDF to Markdown and JSON
Public repository for Agent Skills
Specification and documentation for Agent Skills
Helps data scientists define testable self-documenting dataflows
Your Personal Research Multi-Tool
Skills Catalog for Codex
A Model Context Protocol server for searching and analyzing arXiv
EPUB to audiobook converter, optimized for Audiobookshelf
Git-based data version control for machine learning workflows
When LLM Meets Domain Experts
Uncommon Objects in 3D dataset
Evals is a framework for evaluating LLMs and LLM systems
A Repo For Document AI
Reading book source
Generate audiobooks from EPUBs, PDFs and text with captions
ContextGem: Effortless LLM extraction from documents
OCR model for complex documents with layout-aware structured outputs
A robust, efficient, low-latency speech-to-text library
AI video agents framework for next-gen video interactions