Real-World Centric Foundation GUI Agents
SOTA discrete acoustic codec models with 40/75 tokens per second
Windrecorder is a memory search app by records everything
A framework to enable multimodal models to operate a computer
Web based localization tool with tight version control integration
Open source AI model for generating full songs from lyrics prompts
LLM Large Model of Selling Anchor
The AI framework that adds the engineering to prompt engineering
State-of-the-art diffusion models for image and audio generation
Use pretrained transformers like BERT, XLNet and GPT-2 in spaCy
API Documentation Browser
Apache Lucene open-source search software
ONNX-TensorRT: TensorRT backend for ONNX
Visualize and compare datasets, target values and associations
Create beautiful slides on the web using Claude's frontend skills
Synthetic data generators for structured and unstructured text
ChatGPT extension for scientific research work
Qwen-Image-Layered: Layered Decomposition for Inherent Editablity
Open-weight, large-scale hybrid-attention reasoning model
This repo contains the code for 1D tokenizer and generator
Open-source framework for conversational voice AI agents
The data structure for multimodal data
A Fancy and Fast Emacs Configuration
Autonomous LLM agent for end-to-end data science workflows
Block Diffusion for Ultra-Fast Speculative Decoding