Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024)
Unified KV Cache Compression Methods for Auto-Regressive Models
text and image to video generation: CogVideoX (2024) and CogVideo
Renderer for the harmony response format to be used with gpt-oss
Image polygonal annotation with Python
A community-supported supercharged version of paperless
OCRmyPDF adds an OCR text layer to scanned PDF files
Robust Speech Recognition via Large-Scale Weak Supervision
Open source platform for the machine learning lifecycle
Specification and documentation for Agent Skills
The machine learning toolkit for time series analysis in Python
Public repository for Agent Skills
Adding guardrails to large language models
Python tool for converting files and office documents to Markdown
Hub of ready-to-use datasets for ML models
gpt-oss-120b and gpt-oss-20b are two open-weight language models
Trainable models and NN optimization tools
A text-to-speech, speech-to-text and speech-to-speech library
Free, high-quality text-to-speech API endpoint to replace OpenAI
Petastorm library enables single machine or distributed training
This repository is a curated collection of links to various courses
Paste Markdown and AI responses into Word Excel instantly fast
Core ML tools contain supporting tools for Core ML model conversion
Unofficial Python API and agentic skill for Google NotebookLM
Generate audiobooks from EPUBs, PDFs and text with captions