Inference script for Oasis 500M
Document Image Parsing via Heterogeneous Anchor Prompting”
Framework for building neural networks
StreamSpeech is a seamless model for offline speech recognition
Toolkit for audio, music, and speech generation
Advanced techniques for RAG systems
The best ChatGPT that $100 can buy
A secure sandbox environment for malware developers and red teamers
A Model Context Protocol server for searching and analyzing arXiv
4M: Massively Multimodal Masked Modeling
Guiding Instruction-based Image Editing via Multimodal Large Language
Refer and Ground Anything Anywhere at Any Granularity
A Model Context Protocol (MCP) Gateway & Registry
The official Meta Llama 3 GitHub site
Utilities intended for use with Llama models
Open-source platform for building enterprise-grade agents
FAIR Sequence Modeling Toolkit 2
ICLR2024 Spotlight: curation/training code, metadata, distribution
Official DeiT repository
Provides code for running inference with the SegmentAnything Model
Anthropic's Interactive Prompt Engineering Tutorial
Anthropic's educational courses
GLM-4-Voice | End-to-End Chinese-English Conversational Model
Ring is a reasoning MoE LLM provided and open-sourced by InclusionAI
A Customizable Image-to-Video Model based on HunyuanVideo