High-Fidelity and Controllable Generation of Textured 3D Assets
Official code base for LeWorldModel: Stable End-to-End Joint-Embedding
Open Source Speech Language Model
Open-source industrial-grade ASR models
Hunyuan Translation Model Version 1.5
Block Diffusion for Ultra-Fast Speculative Decoding
Implementation of "MobileCLIP" CVPR 2024
High-resolution models for human tasks
Ling is a MoE LLM provided and open-sourced by InclusionAI
Genome modeling and design across all domains of life
Pretrained time-series foundation model developed by Google Research
Inference script for Oasis 500M
Generate Any 3D Scene in Seconds
Fast and Universal 3D reconstruction model for versatile tasks
Memory-efficient and performant finetuning of Mistral's models
ChatGPT interface with better UI
Reproduction of Poetiq's record-breaking submission to the ARC-AGI-1
Unified Multimodal Understanding and Generation Models
DeepMind model for tracking arbitrary points across videos & robotics
GLM-4.6V/4.5V/4.1V-Thinking, towards versatile multimodal reasoning
OCR expert VLM powered by Hunyuan's native multimodal architecture
High-Resolution Image Synthesis with Latent Diffusion Models
AI-powered tool to quickly remove watermarks from images flawlessly
StudioOllamaUI is a local, portable interface for Ollama
Official DeiT repository