Deterministic LLMs Outputs for AI Applications and AI Agents
MedicalGPT: Training Your Own Medical GPT Model with ChatGPT Training
textgen, Text Generation models
Dataset of GPT-2 outputs for research in detection, biases, and more
CLIP, Predict the most relevant text snippet given an image
Create videos with Stable Diffusion
CLIP + FFT/DWT/RGB = text to image/video
Chat & pretrained large audio language model proposed by Alibaba Cloud
GLM-4-Voice | End-to-End Chinese-English Conversational Model
Implementation of DALL-E 2, OpenAI's updated text-to-image synthesis
[NeurIPS 2023] ImageReward: Learning and Evaluating Human Preferences
ImageBind One Embedding Space to Bind Them All
Diffusion Transformer with Fine-Grained Chinese Understanding
Tensor search for humans
Machine Learning Systems: Design and Implementation
Pretrained model hub for Keras 3
Open source libraries and APIs to build custom preprocessing pipelines
CogView4, CogView3-Plus and CogView3(ECCV 2024)
Towards Real-World Vision-Language Understanding
A GPT-4o Level MLLM for Vision, Speech and Multimodal Live Streaming
Low-latency REST API for serving text-embeddings
The data structure for multimodal data
Gemma open-weight LLM library, from Google DeepMind
Aider is AI pair programming in your terminal
The official PyTorch implementation of Google's Gemma models