Multi-Agent daTa geneRation Infra and eXperimentation framework
An Open Source text-to-speech system built by inverting Whisper
Concatenate a directory full of files into a single prompt
Implementation of Vision Transformer, a simple way to achieve SOTA
4M: Massively Multimodal Masked Modeling
Guiding Instruction-based Image Editing via Multimodal Large Language
Audiocraft is a library for audio processing and generation
Official implementation of DreamCraft3D
Large Multimodal Models for Video Understanding and Editing
Fundamentals of Machine Learning and Deep Learning
LLM powered fuzzing via OSS-Fuzz
PPTAgent: Generating and Evaluating Presentations
Implementation of "MobileCLIP" CVPR 2024
The official PyTorch implementation of Google's Gemma models
A TTS model capable of generating ultra-realistic dialogue
Code for Language models can explain neurons in language models paper
Provides CTP stock options and Zhongtai Securities XTP
mcp-language-server gives MCP enabled clients access semantic tools
Towards Ultimate Expert Specialization in Mixture-of-Experts Language
Collection of reference environments, offline reinforcement learning
PyTorch code and models for VJEPA2 self-supervised learning from video
The repository provides code for running inference with SAM 2
The ChatGPT Retrieval Plugin lets you easily find personal documents
Inference script for Oasis 500M
Framework for building neural networks