ChatGPT extension for scientific research work
A framework to enable multimodal models to operate a computer
Qwen-Image-Layered: Layered Decomposition for Inherent Editablity
Open-weight, large-scale hybrid-attention reasoning model
⚡ Building applications with LLMs through composability ⚡
This repo contains the code for 1D tokenizer and generator
Open-source framework for conversational voice AI agents
The data structure for multimodal data
Autonomous LLM agent for end-to-end data science workflows
Multimodal Diffusion with Representation Alignment
Audio foundation model excelling in audio understanding
Open source AI model for generating full songs from lyrics prompts
Biomni: a general-purpose biomedical AI agent
AI-Researcher: Autonomous Scientific Innovation
Multi-modal large language model designed for audio understanding
The official PyTorch implementation of Google's Gemma models
Machine Learning Systems: Design and Implementation
Fast and efficient unstructured data extraction
LM Studio Apple MLX engine
Renderer for the harmony response format to be used with gpt-oss
The standard data-centric AI package for data quality and ML
Minimal, clean code for the Byte Pair Encoding (BPE) algorithm
Agent Skill for generating 2D sprite sheets and map, transparent PNG
AI Slack bot for reading, summarizing, and chatting with content
Official implementation of DreamCraft3D