Open source libraries and APIs to build custom preprocessing pipelines
This repository provides an advanced RAG
Tool for visualizing and tracking your machine learning experiments
⚡ Building applications with LLMs through composability ⚡
The data structure for multimodal data
The best ChatGPT that $100 can buy
PPTAgent: Generating and Evaluating Presentations
HunyuanVideo: A Systematic Framework For Large Video Generation Model
Phi-3.5 for Mac: Locally-run Vision and Language Models
Embed images and sentences into fixed-length vectors
Renderer for the harmony response format to be used with gpt-oss
The official PyTorch implementation of Google's Gemma models
Machine Learning Systems: Design and Implementation
Seamlessly integrate LLMs into scikit-learn
An open sourced end-to-end VLM-based GUI Agent
State-of-the-art diffusion models for image and audio generation
Central interface to connect your LLM's with external data
Multimodal Diffusion with Representation Alignment
Integrate ChatGPT into your own discord bot
CLIP + FFT/DWT/RGB = text to image/video
MII makes low-latency and high-throughput inference possible
The standard data-centric AI package for data quality and ML
A Model Context Protocol server for searching and analyzing arXiv
Refer and Ground Anything Anywhere at Any Granularity
FAIR Sequence Modeling Toolkit 2