State-of-the-art Image & Video CLIP, Multimodal Large Language Models
DeepSeek Coder: Let the Code Write Itself
kaldi-asr/kaldi is the official location of the Kaldi project
A lightweight, powerful framework for multi-agent workflows
AI Toolkit for Healthcare Imaging
The most intuitive, flexible, way for researchers to build models
A robust, efficient, low-latency speech-to-text library
A full spaCy pipeline and models for scientific/biomedical documents
Generate high-definition story short videos with one click using AI
Converts text to speech in realtime
Get started w/ building Fullstack Agents using Gemini 2.5 & LangGraph
CodeGeeX2: A More Powerful Multilingual Code Generation Model
Optimizing inference proxy for LLMs
TextWorld is a sandbox learning environment for the training
Full stack AI software engineer
Official MiniMax Model Context Protocol (MCP) server
Solve end to end problems using Llama model family
Easiest and laziest way for building multi-agent LLMs applications
Conversational voice AI agents
Benchmarking Multimodal Agents for Open-Ended Tasks
MiniSom is a minimalistic implementation of the Self Organizing Maps
Airweave lets agents search any app
Parse files for optimal RAG
An Async Bot/API wrapper for Twitch made in Python
AI-data warehouse to enrich, transform and analyze unstructured data