Qwen3-omni is a natively end-to-end, omni-modal LLM
MedicalGPT: Training Your Own Medical GPT Model with ChatGPT Training
Serving LangChain LLM apps automagically with FastApi
Generate 3D objects conditioned on text or images
Run AI-powered workflows over your codebase
CoTracker is a model for tracking any point (pixel) on a video
README file generator, powered by AI
textgen, Text Generation models
Evaluation code for various unsupervised automated metrics
CLIP + FFT/DWT/RGB = text to image/video
Code repo for "WebArena to build Autonomous Agents
Implementation of DALL-E 2, OpenAI's updated text-to-image synthesis
The unified and scalable ML library for large-scale training
Label, clean and enrich text datasets with LLMs
On-device Speech-to-Intent engine powered by deep learning
Serve machine learning models within a Docker container
Open Source Computer Vision Library
Clarity in the current fast-paced mess of Open Source innovation
OpenMMLab's Next Generation Video Understanding Toolbox and Benchmark
human detection using yolov8
FAIR's research platform for object detection research
Fast Python collaborative filtering for implicit feedback datasets
Plug-n-play module turning text-to-image models into animation
LLMFlows - Simple, Explicit and Transparent LLM Apps
The PyTorch-based audio source separation toolkit for researchers