Reads official API docs, studies CLI and MCP servers
Open-Sora: Democratizing Efficient Video Production for All
Video-based AI memory library. Store millions of text chunks in MP4
The Batteries-Included Agent that codes like you
Voice Recognition to Text Tool
Miso TTS is an 8 billion, highly emotive text-to-speech model
Codes/Notebooks for AI Projects
Making RAG Simpler with Small and Open-Sourced Language Models
Open source async coding agent that plans, codes, and opens PRs
MII makes low-latency and high-throughput inference possible
Interaction model for connecting buyers to complete purchases
Free OCR Software: No internet required, easy to use.
airda(Air Data Agent
A Conversational Speech Generation Model
Award-winning modern data processing SDK in C++20
TF2 Deep FloorPlan Recognition using a Multi-task Network
PyTorch implementation of VALL-E (Zero-Shot Text-To-Speech)
Implementations of basic RL algorithms with minimal lines of codes
Codes for "Chameleon: Plug-and-Play Compositional Reasoning
Implementation of NÜWA, attention network for text to video synthesis
A python library built to empower developers
Image Restoration Toolbox (PyTorch). Training and testing codes
A simple PyTorch Implementation of Generative Adversarial Networks