This repo contains the code for 1D tokenizer and generator
A Universal Customization Method for Single and Multi Conditioning
Framework for building neural networks
MARS5 speech model (TTS) from CAMB.AI
This repository contains the official implementation of FastVLM
Set of tools to assess and improve LLM security
Chinese and English multimodal conversational language model
Memory-efficient and performant finetuning of Mistral's models
Ring is a reasoning MoE LLM provided and open-sourced by InclusionAI
Research code artifacts for Code World Model (CWM)
A solution to build and deploy MCP agents and applications
Fast image augmentation library and an easy-to-use wrapper
Play couplet with seq2seq model
95% token savings. 155x faster queries. 16 languages
Qwen3-omni is a natively end-to-end, omni-modal LLM
OCR expert VLM powered by Hunyuan's native multimodal architecture
A backup-first Codex skill for keeping local Codex state fast
A Claude Code plugin that iteratively refines product specifications
A specialized Claude Code workspace for creating long-form
Reflexion: Language Agents with Verbal Reinforcement Learning
Any model. Any hardware. Zero compromise
OCR model for complex documents with layout-aware structured outputs
Multilingual Document Layout Parsing in a Single Vision-Language Model
Agent-ready RPA suite with visual workflow automation tools engine
Structured RAG: ingest, index, query