A Model Context Protocol server for searching and analyzing arXiv
Refer and Ground Anything Anywhere at Any Granularity
Clarity in the current fast-paced mess of Open Source innovation
Deterministic LLMs Outputs for AI Applications and AI Agents
MedicalGPT: Training Your Own Medical GPT Model with ChatGPT Training
textgen, Text Generation models
Open-source all-in-one platform for engineering AI products
Dataset of GPT-2 outputs for research in detection, biases, and more
CLIP, Predict the most relevant text snippet given an image
Create videos with Stable Diffusion
CLIP + FFT/DWT/RGB = text to image/video
GLM-4-Voice | End-to-End Chinese-English Conversational Model
Implementation of DALL-E 2, OpenAI's updated text-to-image synthesis
[NeurIPS 2023] ImageReward: Learning and Evaluating Human Preferences
Machine Learning Systems: Design and Implementation
Pretrained model hub for Keras 3
Open source libraries and APIs to build custom preprocessing pipelines
CogView4, CogView3-Plus and CogView3(ECCV 2024)
Towards Real-World Vision-Language Understanding
A GPT-4o Level MLLM for Vision, Speech and Multimodal Live Streaming
Low-latency REST API for serving text-embeddings
The data structure for multimodal data
Gemma open-weight LLM library, from Google DeepMind
Aider is AI pair programming in your terminal
The official PyTorch implementation of Google's Gemma models