Foundation model for image generation
Official implementation of Watermark Anything with Localized Messages
A ranked list of awesome machine learning Python libraries
Anthropic's educational courses
[NeurIPS 2023] ImageReward: Learning and Evaluating Human Preferences
A GPT-4o Level MLLM for Vision, Speech and Multimodal Live Streaming
Free, high-quality text-to-speech API endpoint to replace OpenAI
Evaluation and Tracking for LLM Experiments
The official repository for ERNIE 4.5 and ERNIEKit
State-of-the-art (SoTA) text-to-video pre-trained model
A specialized Claude Code workspace for creating long-form
Workshop-Level Automated Scientific Discovery via Agentic Tree Search
Open source platform for managing, testing, and deploying AI apps
Redundancy-aware KV Cache Compression for Reasoning Models
RAG Search API
Block Diffusion for Ultra-Fast Speculative Decoding
A Unified Framework for Text-to-3D and Image-to-3D Generation
Scalable data pre processing and curation toolkit for LLMs
Empowering Code Generation with OSS-Instruct
An Open-source Framework for Data-centric Language Agents
Pretrained time-series foundation model developed by Google Research
Long-form streaming TTS system for multi-speaker dialogue generation
This repository contains the official implementation of FastVLM
Fast3R: Towards 3D Reconstruction of 1000+ Images in One Forward Pass
Official PyTorch Implementation