Unleashing 10,000+ Word Generation from Long Context LLMs
Stable Diffusion web UI
A Systematic Framework for Interactive World Modeling
One-click deployment (including offline integration package)
Unified Multimodal Understanding and Generation Models
A GPT-4o Level MLLM for Vision, Speech and Multimodal Live Streaming
Open-source choice to scale, assess and maintain natural language data
Context database designed specifically for AI Agents
Models for the spaCy Natural Language Processing (NLP) library
NLTK Source
A single Gradio + React WebUI with extensions for ACE-Step
Provides CTP stock options and Zhongtai Securities XTP
A python tool that uses GPT-4, FFmpeg, and OpenCV
PPTAgent: Generating and Evaluating Presentations
Dealing with all unstructured data, such as reverse image search
SDK for building interactive UI components over MCP for AI tools
AI-Powered Data Processing: Use LOTUS to process all of your datasets
Seamlessly integrate LLMs into scikit-learn
Practical productivity tools for Claude Code, Codex-CLI
Flexible Photo Recrafting While Preserving Your Identity
MARS5 speech model (TTS) from CAMB.AI
Tensor search for humans
Offical Implementation for "Recursive Multi-Agent Systems"
Repository containing notebooks of my posts on Medium
A New Axis of Sparsity for Large Language Models