Memory-efficient and performant finetuning of Mistral's models
A Model Context Protocol server for searching and analyzing arXiv
Refer and Ground Anything Anywhere at Any Granularity
Dataset of GPT-2 outputs for research in detection, biases, and more
CLIP, Predict the most relevant text snippet given an image
Diffusion Transformer with Fine-Grained Chinese Understanding
Machine Learning Systems: Design and Implementation
Towards Real-World Vision-Language Understanding
The best ChatGPT that $100 can buy
The official PyTorch implementation of Google's Gemma models
Official implementation of DreamCraft3D
Clarity in the current fast-paced mess of Open Source innovation
LLM powered fuzzing via OSS-Fuzz
Documentation for Google's Gen AI site - including Gemini API & Gemma
Official python implementation of UTCP. UTCP is an open standard
Official code for Style Aligned Image Generation via Shared Attention
PPTAgent: Generating and Evaluating Presentations
Resources, corpora, and tools for Chinese natural language processing
Implementation of MusicLM music generation model in Pytorch
Repo for external large-scale work
Official PyTorch Implementation of "Scalable Diffusion Models"
High-Resolution Image Synthesis with Latent Diffusion Models
Plug-n-play module turning text-to-image models into animation
Overcoming Data Limitations for High-Quality Video Diffusion Models
A Conversational Speech Generation Model