ContextGem: Effortless LLM extraction from documents
A Repo For Document AI
A Python package for extending the official PyTorch
Machine learning metrics for distributed, scalable PyTorch application
machine learning tutorials (mainly in Python3)
LangChain powered shell command generator and runner CLI
A lightweight framework for building LLM-based agents
Enhances Tesseract OCR output using LLMs (local or API)
Multimodal Agents as Smartphone Users, an LLM-based multimodal agent
Open-source deep-learning framework
End-to-End Library for Continual Learning based on PyTorch
Build portable, production-ready MLOps pipelines
AI tool that removes hardcoded subtitles and text from videos locally
Deep learning optimization library: makes distributed training easy
Efficient Triton Kernels for LLM Training
Open source AI model for generating full songs from lyrics prompts
LLM Large Model of Selling Anchor
A course of learning LLM inference serving on Apple Silicon
The repository provides code for running inference with SAM 2
High-Resolution Image Synthesis with Latent Diffusion Models
gpt-oss-120b and gpt-oss-20b are two open-weight language models
Capable of understanding text, audio, vision, video
Large-language-model & vision-language-model based on Linear Attention
An Open Source text-to-speech system built by inverting Whisper
OCR expert VLM powered by Hunyuan's native multimodal architecture