A lightweight audio-to-MIDI converter with pitch bend detection
Label Studio is a multi-type data labeling and annotation tool
Build AI-powered semantic search applications
Infrastructure for AI code interpreting that's powering E2B
An Open-Source Programming Framework for Agentic AI
lightweight package to simplify LLM API calls
PArallel Distributed Deep LEarning: Machine Learning Framework
High-Resolution 3D Assets Generation with Large Scale Diffusion Models
Open-source, code-first Python toolkit for building, evaluating, etc.
Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024)
A single Gradio + React WebUI with extensions for ACE-Step
Documentation for Google's Gen AI site - including Gemini API & Gemma
GitHub's official MCP Server
ChatMCP is an AI chat client implementing the Model Context Protocol
DSPy: The framework for programming—not prompting—language models
RAGFlow is an open-source RAG (Retrieval-Augmented Generation) engine
Bring the notion of Model-as-a-Service to life
Open-source multi-speaker long-form text-to-speech model
Reproduction of Poetiq's record-breaking submission to the ARC-AGI-1
A command-line interface for interacting with MCP
FastGPT is a knowledge-based platform built on the LLMs
OCR offline image text recognition command line windows program
Collection of Gemma 3 variants that are trained for performance
LTX-Video Support for ComfyUI
Taming Stable Diffusion for Lip Sync