Multi-Agent daTa geneRation Infra and eXperimentation framework
An Open Source text-to-speech system built by inverting Whisper
Implementation of Vision Transformer, a simple way to achieve SOTA
Concatenate a directory full of files into a single prompt
4M: Massively Multimodal Masked Modeling
Guiding Instruction-based Image Editing via Multimodal Large Language
MCP integration platforms for AI agents to use tools at any scale
Audiocraft is a library for audio processing and generation
Official implementation of DreamCraft3D
Large Multimodal Models for Video Understanding and Editing
Fundamentals of Machine Learning and Deep Learning
LLM powered fuzzing via OSS-Fuzz
PPTAgent: Generating and Evaluating Presentations
Implementation of "MobileCLIP" CVPR 2024
mcp-language-server gives MCP enabled clients access semantic tools
A TTS model capable of generating ultra-realistic dialogue
Code for Language models can explain neurons in language models paper
The official PyTorch implementation of Google's Gemma models
Provides CTP stock options and Zhongtai Securities XTP
Towards Ultimate Expert Specialization in Mixture-of-Experts Language
PyTorch code and models for VJEPA2 self-supervised learning from video
The repository provides code for running inference with SAM 2
The ChatGPT Retrieval Plugin lets you easily find personal documents
Collection of reference environments, offline reinforcement learning
Inference script for Oasis 500M