Implementation of Video Diffusion Models
Open-source Video Translation Skill
Context database designed specifically for AI Agents
Gemma open-weight LLM library, from Google DeepMind
Extract schema, statistics and entities from datasets
A lightweight framework for building LLM-based agents
LISA: Reasoning Segmentation via Large Language Model
Build a large language model from 0 only with Python foundation
Integrating LLMs into structured NLP pipelines
A Frontier Mathematical Coding Agent
Biomni: a general-purpose biomedical AI agent
AI-Powered Personalized Learning Assistant
AI-Researcher: Autonomous Scientific Innovation
The official PyTorch implementation of Google's Gemma models
Machine Learning Systems: Design and Implementation
Adding guardrails to large language models
Seamlessly integrate LLMs into scikit-learn
GLM-4.6V/4.5V/4.1V-Thinking, towards versatile multimodal reasoning
Automate native Android apps with AI using accessibility APIs
Renderer for the harmony response format to be used with gpt-oss
State-of-the-art diffusion models for image and audio generation
A refreshing functional take on deep learning
local-first semantic code search engine
Implementation of Make-A-Video, new SOTA text to video generator
Multilingual Document Layout Parsing in a Single Vision-Language Model