OCR model for complex documents with layout-aware structured outputs
State-of-the-art TTS model under 25MB
Chemcrow
Real-World Centric Foundation GUI Agents
FlashInfer: Kernel Library for LLM Serving
A lightweight, powerful framework for multi-agent workflows
Wraps all package managers with a unifying CLI
Fast and memory-efficient exact attention
A simple tool for reading in poorly redacted documents
ChatGLM-6B: An Open Bilingual Dialogue Language Model
Fake Protocol Server
Bitcart is a platform for merchants, users and developers
A generative speech model for daily dialogue
An open and fair framework for everyone to build AI agents
High-Resolution Image Synthesis with Latent Diffusion Models
AI Agent Networks for Open Collaboration
Automated framework for asset discovery and vulnerability scanning
The first AI agent that builds permissionless integrations
Using AI models to automatically provide commentary and edit videos
Parameterize, execute, and analyze notebooks
Harmonized and Coherent Human Image Animation
Fast-stable-diffusion + DreamBooth
A tool to use the Ai2 Open Coding Agents Soft-Verified Agents
Block Diffusion for Ultra-Fast Speculative Decoding
Socket.IO integration for Flask applications