LTX-Video Support for ComfyUI
When LLM Meets Domain Experts
PyTorch code and models for the DINOv2 self-supervised learning
LLM-based agent for general purpose software engineering tasks
Laravel-focused MCP server for augmenting AI powered local development
tiktoken is a fast BPE tokeniser for use with OpenAI's models
"Big Model" trains a visual multimodal VLM with 26M parameters
An experimental version of DeepSeek model
A Powerful Native Multimodal Model for Image Generation
Reproduction of Poetiq's record-breaking submission to the ARC-AGI-1
Diversity-driven optimization and large-model reasoning ability
The official Meta Llama 3 GitHub site
Get a ChatGPT plugin up and running in under 5 minutes
Diffusion Transformer with Fine-Grained Chinese Understanding
Provider-agnostic, open-source evaluation infrastructure
Documentation for Google's Gen AI site - including Gemini API & Gemma
Offline inference engine for art, real-time voice conversations
PyTorch code and models for V-JEPA self-supervised learning from video
Code to accompany "A Method for Animating Children's Drawings"
Collection of Gemma 3 variants that are trained for performance
Sample code and notebooks for Generative AI on Google Cloud
A simple, secure MCP-to-OpenAPI proxy server
A fast, powerful, and simple hierarchical vision transformer
Code release for Cut and Learn for Unsupervised Object Detection
CLIP, Predict the most relevant text snippet given an image