Benchmark LLMs by fighting in Street Fighter 3
Synthetic data generators for structured and unstructured text
Open source AI model for generating full songs from lyrics prompts
LLM Large Model of Selling Anchor
Qwen-Image-Layered: Layered Decomposition for Inherent Editablity
Renderer for the harmony response format to be used with gpt-oss
This repo contains the code for 1D tokenizer and generator
The data structure for multimodal data
Windrecorder is a memory search app by records everything
Audio foundation model excelling in audio understanding
Multimodal Diffusion with Representation Alignment
Web based localization tool with tight version control integration
Autonomous LLM agent for end-to-end data science workflows
An open sourced end-to-end VLM-based GUI Agent
Minimal, clean code for the Byte Pair Encoding (BPE) algorithm
⚡ Building applications with LLMs through composability ⚡
Biomni: a general-purpose biomedical AI agent
OpenRecall is a fully open-source, privacy-first alternative
AI-Researcher: Autonomous Scientific Innovation
Towards Studio-Grade Character Animation via In-Context Learning of 3D
The official PyTorch implementation of Google's Gemma models
Machine Learning Systems: Design and Implementation
Multi-modal large language model designed for audio understanding
The standard data-centric AI package for data quality and ML
AI Slack bot for reading, summarizing, and chatting with content