Code to accompany "A Method for Animating Children's Drawings"
Official implementation of DreamCraft3D
A simple screen parsing tool towards pure vision based GUI agent
AI-powered tool for developers, simplifying coding tasks
A library for accelerating Transformer models on NVIDIA GPUs
Library for OCR-related tasks powered by Deep Learning
High-level training, data augmentation, and utilities for Pytorch
An Open Source text-to-speech system built by inverting Whisper
On-device Speech-to-Intent engine powered by deep learning
Local Lambda debug, CodeWhisperer, SAM/CFN syntax, etc.
Model Context Protocol server that integrates AgentQL's data
Generate blog articles from video or audio
Controllable and fast Text-to-Speech for over 7000 languages
Tooling for the Common Objects In 3D dataset
State-of-the-art Image & Video CLIP, Multimodal Large Language Models
Inference Llama 2 in one file of pure C
AI-Powered Wiki Generator for GitHub/Gitlab/Bitbucket Repositories
Making RAG Simpler with Small and Open-Sourced Language Models
Marrying Grounding DINO with Segment Anything & Stable Diffusion
Ultimate meta-skill for generating best-in-class Claude Code skills
A New Axis of Sparsity for Large Language Models
Context engineering is the new vibe coding
"Big Model" trains a visual multimodal VLM with 26M parameters
AI Agent Networks for Open Collaboration
LLM based autonomous agent that does online comprehensive research