Open source libraries and APIs to build custom preprocessing pipelines
This repository provides an advanced RAG
Tool for visualizing and tracking your machine learning experiments
The data structure for multimodal data
⚡ Building applications with LLMs through composability ⚡
Phi-3.5 for Mac: Locally-run Vision and Language Models
PPTAgent: Generating and Evaluating Presentations
HunyuanVideo: A Systematic Framework For Large Video Generation Model
The best ChatGPT that $100 can buy
Embed images and sentences into fixed-length vectors
Renderer for the harmony response format to be used with gpt-oss
The official PyTorch implementation of Google's Gemma models
Machine Learning Systems: Design and Implementation
Seamlessly integrate LLMs into scikit-learn
An open sourced end-to-end VLM-based GUI Agent
State-of-the-art diffusion models for image and audio generation
Central interface to connect your LLM's with external data
Multimodal Diffusion with Representation Alignment
Integrate ChatGPT into your own discord bot
CLIP + FFT/DWT/RGB = text to image/video
MII makes low-latency and high-throughput inference possible
The standard data-centric AI package for data quality and ML
A Model Context Protocol server for searching and analyzing arXiv
Refer and Ground Anything Anywhere at Any Granularity
FAIR Sequence Modeling Toolkit 2