GLM-4.5: Open-source LLM for intelligent agents by Z.ai
Recovering the Visual Space from Any Views
CodeGeeX2: A More Powerful Multilingual Code Generation Model
CogView4, CogView3-Plus and CogView3(ECCV 2024)
Lets make video diffusion practical
An experimental version of DeepSeek model
Programmatic access to the AlphaGenome model
Controllable & emotion-expressive zero-shot TTS
Open-source multi-speaker long-form text-to-speech model
Pokee Deep Research Model Open Source Repo
An Efficient Agentic Model for Computer Use
GLM-4-Voice | End-to-End Chinese-English Conversational Model
A Customizable Image-to-Video Model based on HunyuanVideo
A Systematic Framework for Interactive World Modeling
Open Source Speech Language Model
Open-source industrial-grade ASR models
Ling is a MoE LLM provided and open-sourced by InclusionAI
Multimodal-Driven Architecture for Customized Video Generation
Repo for SeedVR2 & SeedVR
Video Object and Interaction Deletion
Audio foundation model excelling in audio understanding
Ultra-Efficient LLMs on End Device
A series of math-specific large language models of our Qwen2 series
Inference code for scalable emulation of protein equilibrium ensembles
Generating Immersive, Explorable, and Interactive 3D Worlds