Official PyTorch Implementation
A framework to enable multimodal models to operate a computer
This repo contains the code for 1D tokenizer and generator
Repo of Qwen2-Audio chat & pretrained large audio language model
Open-source framework for conversational voice AI agents
The data structure for multimodal data
Build AI-powered semantic search applications
Implementation of Video Diffusion Models
Open-source Video Translation Skill
Integrating LLMs into structured NLP pipelines
local-first semantic code search engine
A system for agentic LLM-powered data processing and ETL
Open source AI model for generating full songs from lyrics prompts
Generate music based on natural language prompts using LLMs
InvokeAI is a leading creative engine for Stable Diffusion models
Biomni: a general-purpose biomedical AI agent
AI-Powered Personalized Learning Assistant
The official PyTorch implementation of Google's Gemma models
Machine Learning Systems: Design and Implementation
Seamlessly integrate LLMs into scikit-learn
Renderer for the harmony response format to be used with gpt-oss
Faster and easier training and deployments
Fast and efficient unstructured data extraction
A lightweight framework for building LLM-based agents
Gemma open-weight LLM library, from Google DeepMind