A single Gradio + React WebUI with extensions for ACE-Step
Claude Code action for GitHub PRs
Renderer for the harmony response format to be used with gpt-oss
A Systematic Framework for Interactive World Modeling
Reproduction of Poetiq's record-breaking submission to the ARC-AGI-1
Dramatron uses large language models to generate coherent scripts
AI code reviews, just like your senior dev would do
Reverse-engineered Python API for Google Gemini web app
GUI/CLI tool for downloading Xiaohongshu
A robust, efficient, low-latency speech-to-text library
Towards self-verifiable mathematical reasoning
Code for the paper Language Models are Unsupervised Multitask Learners
Qwen-Image-Layered: Layered Decomposition for Inherent Editablity
Code to accompany "A Method for Animating Children's Drawings"
DeepSeek LLM: Let there be answers
TTS for Context-Aware Speech Generation and True-to-Life Voice Cloning
Open source text-to-speech tool, supports extra-long text
Open source full-stack AI vibe coding platform & web app generator
Instant AI code reviews
Python SDK for Claude Agent
The official Python library for the OpenAI API
From Images to High-Fidelity 3D Assets
Multimodal model achieving SOTA performance
Documentation for Google's Gen AI site - including Gemini API & Gemma
PyTorch3D is FAIR's library of reusable components for deep learning