Advancing Open-source World Models
Powerful tool that lets you create and run intelligent agents
Uses Qwen3-ASR, local LLM, Whisper, TEN-VAD
An AI personal assistant for your digital brain
RAGFlow is an open-source RAG (Retrieval-Augmented Generation) engine
Python inference and LoRA trainer package for the LTX-2 audio–video
AIHawk aims to easy job hunt process by automating job applications
Reverse-engineered Python API for Google Gemini web app
AI Toolkit for Healthcare Imaging
AI-powered video clipping and highlight generation
SOTA Open Source TTS
State-of-the-art TTS model under 25MB
A Domain-Fronting Relay that routes traffic though GAS
Text and image to video generation: CogVideoX and CogVideo
A frontier, first-principles handbook
A command-line productivity tool powered by AI large language models
A lightweight audio-to-MIDI converter with pitch bend detection
A GUI tool for extracting hard-coded subtitle (hardsub) from videos
Director, Screenwriter, Producer, and Video Generator All-in-One
Synchronized Translation for Videos
Python bindings for llama.cpp
Letta (formerly MemGPT) is a framework for creating LLM services
High-Quality Voice Cloning TTS for 600+ Languages
Instant voice cloning by MIT and MyShell. Audio foundation model
Interact with your documents using the power of GPT