Tensor search for humans
Benchmarking synthetic data generation methods
GLM-4.6V/4.5V/4.1V-Thinking, towards versatile multimodal reasoning
MemoryOS is designed to provide a memory operating system
Korea Investment & Securities Open API Github
Visual intelligence for your home.
Open-source industrial-grade ASR models
A frontier, first-principles handbook
End-to-end pipeline converting generative videos
OpenTinker is an RL-as-a-Service infrastructure for foundation models
Motion-controllable Video Generation via Latent Trajectory Guidance
Persistent context and multi-instance coordination
Official implementation of Watermark Anything with Localized Messages
Video understanding codebase from FAIR for reproducing video models
Eva is an A.I. assistant that helps users multi-task.
A Conversational Speech Generation Model
Capable of understanding text, audio, vision, video
Qwen3-omni is a natively end-to-end, omni-modal LLM
A collaboration friendly studio for NeRFs
Llama Chinese community, real-time aggregation
Cosmos-RL is a flexible and scalable Reinforcement Learning framework
DeepMind model for tracking arbitrary points across videos & robotics
Renderer for the harmony response format to be used with gpt-oss
Simplest working implementation of Stylegan2
TensorRT LLM provides users with an easy-to-use Python API