Hunyuan Translation Model Version 1.5
Tongyi Deep Research, the Leading Open-source Deep Research Agent
Visual Causal Flow
A Systematic Framework for Interactive World Modeling
Industrial-level controllable zero-shot text-to-speech system
Python SDK for Claude Agent
A Unified Framework for Text-to-3D and Image-to-3D Generation
Open-source deep-learning framework
Contexts Optical Compression
Sharp Monocular Metric Depth in Less Than a Second
An experimental version of DeepSeek model
Python bindings for llama.cpp
Qwen3-omni is a natively end-to-end, omni-modal LLM
Audio foundation model excelling in audio understanding
Video Object and Interaction Deletion
Collection of Gemma 3 variants that are trained for performance
Tool for exploring and debugging transformer model behaviors
ChatGLM-6B: An Open Bilingual Dialogue Language Model
PyTorch code and models for the DINOv2 self-supervised learning
GLM-4-Voice | End-to-End Chinese-English Conversational Model
tiktoken is a fast BPE tokeniser for use with OpenAI's models
Provides convenient access to the Anthropic REST API from any Python 3
Generating Immersive, Explorable, and Interactive 3D Worlds
Recovering the Visual Space from Any Views
Multimodal Diffusion with Representation Alignment