Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024)
Chemcrow
Make websites accessible for AI agents
LTX-Video Support for ComfyUI
Generate audiobooks from EPUBs, PDFs and text with captions
A Systematic Framework for Interactive World Modeling
Adds support for Yandex Smart Home (Alice voice assistant)
AI Toolkit for Healthcare Imaging
Official inference repo for FLUX.1 models
Minimal CLI coding agent by Mistral
Offline Text To Speech synthesis for python
Models for the spaCy Natural Language Processing (NLP) library
A lightweight audio-to-MIDI converter with pitch bend detection
Open-source multi-speaker long-form text-to-speech model
Industrial-level controllable zero-shot text-to-speech system
Contexts Optical Compression
Conversational voice AI agents
DSPy: The framework for programming—not prompting—language models
Open-source autonomous AI software engineer
RAGFlow is an open-source RAG (Retrieval-Augmented Generation) engine
Code for the paper Language Models are Unsupervised Multitask Learners
The official gpt4free repository
lightweight package to simplify LLM API calls
The common language for platforms, agents and businesses.
Renderer for the harmony response format to be used with gpt-oss