4M: Massively Multimodal Masked Modeling
Guiding Instruction-based Image Editing via Multimodal Large Language
Agent toolkit providing semantic retrieval and editing capabilities
PyTorch code and models for V-JEPA self-supervised learning from video
PyTorch code and models for the DINOv2 self-supervised learning
Towards Ultimate Expert Specialization in Mixture-of-Experts Language
A batteries-included library for building AI-powered software
The Unified Machine Learning Framework
Repo of Qwen2-Audio chat & pretrained large audio language model
Open Source Document Management System for Digital Archives
PPTAgent: Generating and Evaluating Presentations
Get a ChatGPT plugin up and running in under 5 minutes
General proxy performance testing tool based on Clash using Telegram
Composio equip's your AI agents & LLMs
PyTorch version of Stable Baselines
Implementation of Denoising Diffusion Probabilistic Model in Pytorch
An open source implementation of CLIP
A refreshing functional take on deep learning
Unified Model Serving Framework
Audiocraft is a library for audio processing and generation
ktrain is a Python library that makes deep learning AI more accessible
Simple, unified interface to multiple Generative AI providers
A high-performance ML model serving framework, offers dynamic batching
Multilingual sentence & image embeddings with BERT
Set of tools to assess and improve LLM security