A SOTA open-source image editing model
Repo of Qwen2-Audio chat & pretrained large audio language model
Expose your FastAPI endpoints as Model Context Protocol (MCP) tools
Build Vision Agents quickly with any model or video provider
The open source post-building layer for agents
Accessible large language models via k-bit quantization for PyTorch
Document content and metadata extraction microservice
A PyTorch-based Speech Toolkit
Self-evolving AI agent framework for automated workflows
Sunfish: a Python Chess Engine in 111 lines of code
Parallax is a distributed model serving framework
Maimaibot, a (more focused) multi-platform intelligent agent
LLM
Chat with your documents using local AI
Multimodal Agents as Smartphone Users, an LLM-based multimodal agent
Multi-modal large language model designed for audio understanding
Open-source framework for intelligent speech interaction
Open Source Differentiable Computer Vision Library
AI discovers 520000 stable inorganic crystal structures for research
LLM training code for MosaicML foundation models
SimpleMem: Efficient Lifelong Memory for LLM Agents
A Model Context Protocol server for searching and analyzing arXiv
Open-source platform for building enterprise-grade agents
Open-weight, large-scale hybrid-attention reasoning model
Large-language-model & vision-language-model based on Linear Attention