In-depth tutorials on LLMs, RAGs and real-world AI agent applications
Ling-V2 is a MoE LLM provided and open-sourced by InclusionAI
Large Audio Language Model built for natural interactions
Stable Diffusion web UI
Specification and documentation for Agent Skills
Fast, powerful, git-native ticket tracking in a single bash script
95% token savings. 155x faster queries. 16 languages
Follow along with my AI Agents Masterclass videos
Context engineering is the new vibe coding
Chinese Llama-3 LLMs) developed from Meta Llama 3
Chinese XLNet pre-trained model
Document Image Parsing via Heterogeneous Anchor Prompting”
Framework for building neural networks
StreamSpeech is a seamless model for offline speech recognition
Omnilingual ASR Open-Source Multilingual SpeechRecognition
Implementation of Vision Transformer, a simple way to achieve SOTA
A Model Context Protocol server for searching and analyzing arXiv
4M: Massively Multimodal Masked Modeling
This repository contains the official implementation of FastVLM
Refer and Ground Anything Anywhere at Any Granularity
Utilities intended for use with Llama models
FAIR Sequence Modeling Toolkit 2
Fast3R: Towards 3D Reconstruction of 1000+ Images in One Forward Pass
ICLR2024 Spotlight: curation/training code, metadata, distribution
A Production-ready Reinforcement Learning AI Agent Library