Circuit diagrams and firmware source code for Gboard DIY keyboards
Instant voice cloning by MIT and MyShell. Audio foundation model
The official repo of Qwen chat & pretrained large language model
AutoGluon: AutoML for Image, Text, and Tabular Data
A lightweight approach to removing Google web service dependency
Deep Research framework, combining language models with tools
ChatGPT extension for scientific research work
Repo of Qwen2-Audio chat & pretrained large audio language model
21 Lessons, Get Started Building with Generative AI
GLM-4.6V/4.5V/4.1V-Thinking, towards versatile multimodal reasoning
tiktoken is a fast BPE tokeniser for use with OpenAI's models
Diffusion Transformer with Fine-Grained Chinese Understanding
Large Multimodal Models for Video Understanding and Editing
Check links in web documents or full websites
An open source implementation of CLIP
Official PyTorch Implementation
A TTS model capable of generating ultra-realistic dialogue
Visual Causal Flow
95% token savings. 155x faster queries. 16 languages
A Coverage-Guided, Native Python Fuzzer
LaTeX source and supporting code for Think Python, 2nd edition
Extract one time password (OTP) secrets from QR codes
Powerful open source team chat application
Flexible Photo Recrafting While Preserving Your Identity
Concatenate a directory full of files into a single prompt