Tencent Hunyuan Multimodal diffusion transformer (MM-DiT) model
Ongoing research training transformer models at scale
Video-based AI memory library. Store millions of text chunks in MP4
Anti-Detect Browser that passes every bot detection test
Build production-ready AI agents in both Python and Typescript
GLM-4-Voice | End-to-End Chinese-English Conversational Model
Implementation of the Surya Foundation Model for Heliophysics
An easy-to-use & supercharged open-source experiment tracker
Easy-to-use and powerful NLP library with Awesome model zoo
Agent Framework / shim to use Pydantic with LLMs
AI agents running research on single-GPU nanochat training
A new DSL and server for AI agents and multi-step tasks
Simple package for monitoring and control your NVIDIA Jetson
A 0.1B Omni model trained from scratch
A Claude Code plugin that iteratively refines product specifications
All-in-one native macOS AI chat application
Enterprise multi-agent orchestration framework for scalable AI apps
Open Source Speech Language Model
Build multimodal AI applications with cloud-native stack
Specify a github or local repo, github pull request
AI memory OS for LLM and Agent systems
High-resolution models for human tasks
A Unified Framework for Text-to-3D and Image-to-3D Generation
Multimodal-Driven Architecture for Customized Video Generation
Multimodal Diffusion with Representation Alignment