A lightweight text-to-speech model with zero-shot voice cloning
95% token savings. 155x faster queries. 16 languages
A TTS that fits in your CPU (and pocket)
Inference script for Oasis 500M
Framework for building neural networks
StreamSpeech is a seamless model for offline speech recognition
Toolkit for audio, music, and speech generation
Build GenAI application quick and easy
Advanced techniques for RAG systems
Fast and Universal 3D reconstruction model for versatile tasks
Implement a ChatGPT-like LLM in PyTorch from scratch, step by step
Real-time behaviour synthesis with MuJoCo, using Predictive Control
A secure sandbox environment for malware developers and red teamers
A Model Context Protocol server for searching and analyzing arXiv
4M: Massively Multimodal Masked Modeling
Guiding Instruction-based Image Editing via Multimodal Large Language
This repository contains the official implementation of FastVLM
Refer and Ground Anything Anywhere at Any Granularity
The official Meta Llama 3 GitHub site
Inference code for CodeLlama models
Utilities intended for use with Llama models
Set of tools to assess and improve LLM security
Open-source platform for building enterprise-grade agents
FAIR Sequence Modeling Toolkit 2
ICLR2024 Spotlight: curation/training code, metadata, distribution