Stable Diffusion web UI
Code for the paper Language Models are Unsupervised Multitask Learners
Chat with your SQL database
High-Resolution Image Synthesis with Latent Diffusion Models
Interact with your SQL database, Natural Language to SQL using LLMs
Contexts Optical Compression
Research code artifacts for Code World Model (CWM)
A Model Context Protocol (MCP) server
Release for Improved Denoising Diffusion Probabilistic Models
gpt-4o for windows, macos and linux
Omnilingual ASR Open-Source Multilingual SpeechRecognition
Clone a voice in 5 seconds to generate arbitrary speech in real-time
A unified interface for distributed computing
Solve end to end problems using Llama model family
Parse files for optimal RAG
A Customizable Image-to-Video Model based on HunyuanVideo
Dataset of GPT-2 outputs for research in detection, biases, and more
Making Enterprise Data Intelligent and Responsive for AI
Ready-to-use OCR with 80+ supported languages
The largest collection of PyTorch image encoders / backbones
Reverse-engineered Python API for Google Gemini web app
Advanced techniques for RAG systems
Trainable, memory-efficient, and GPU-friendly PyTorch reproduction
A neural network that transforms a design mock-up into static websites
A Model Context Protocol (MCP) server implementation