Multi-engine plugin to specify agents with reinforcement learning
Best practices on recommendation systems
This repo contains the code for 1D tokenizer and generator
A Universal Customization Method for Single and Multi Conditioning
Open-source framework for intelligent speech interaction
Reading book source
MARS5 speech model (TTS) from CAMB.AI
This repository provides an advanced RAG
Open Source Differentiable Computer Vision Library
Fast inference engine for Transformer models
Model Context Protocol server that integrates AgentQL's data
GLM-4.6V/4.5V/4.1V-Thinking, towards versatile multimodal reasoning
GLM-4.6V/4.5V/4.1V-Thinking, towards versatile multimodal reasoning
A trainable PyTorch reproduction of AlphaFold 3
Official Repo For "Sa2VA: Marrying SAM2 with LLaVA
LLM-based agent for general purpose software engineering tasks
High-Fidelity and Controllable Generation of Textured 3D Assets
Multi-modal large language model designed for audio understanding
Large Multimodal Models for Video Understanding and Editing
ComfyUI integration for Microsoft's VibeVoice text-to-speech model
Deploy and share agents with open infrastructure
An MCP server that autonomously evaluates web applications
The leading agent orchestration platform for Claude
Get started w/ building Fullstack Agents using Gemini 2.5 & LangGraph
Repo of Qwen2-Audio chat & pretrained large audio language model