CLIP, Predict the most relevant text snippet given an image
ChatGLM-6B: An Open Bilingual Dialogue Language Model
Finetune Llama 3.2, Mistral, Phi & Gemma LLMs 2-5x faster
Dough is a open source tool for steering AI animations with precision
Gorilla: An API store for LLMs
Turn your website into a GIF
Implementation of Phenaki Video, which uses Mask GIT
Time series forecasting with PyTorch
Implementation of Imagen, Google's Text-to-Image Neural Network
A framework to enable multimodal models to operate a computer
Free, high-quality text-to-speech API endpoint to replace OpenAI
Open source framework for deep learning satellite and aerial imagery
One API call, pull Claude agent, completely sandboxed
slime is an LLM post-training framework for RL Scaling
95% token savings. 155x faster queries. 16 languages
Chinese XLNet pre-trained model
Context data platform for building observable, self-learning AI agents
AI discovers 520000 stable inorganic crystal structures for research
GLM-4-Voice | End-to-End Chinese-English Conversational Model
Generating Immersive, Explorable, and Interactive 3D Worlds
The Memory layer for AI Agents
State-of-the-art diffusion models for image and audio generation
Implementation of Video Diffusion Models
A library for scientific machine learning & physics-informed learning
AutoGluon: AutoML for Image, Text, and Tabular Data