Fast and efficient unstructured data extraction
LM Studio Apple MLX engine
AI Slack bot for reading, summarizing, and chatting with content
Official implementation of DreamCraft3D
Papers from the computer science community to read and discuss
Multilingual Document Layout Parsing in a Single Vision-Language Model
Shared repository for open-sourced projects from the Google AI Lang
Build multimodal AI applications with cloud-native stack
The official implementation of RAPTOR
Ready-to-run cloud templates for RAG
Document Index for Vectorless, Reasoning-based RAG
Open-Source Dual-Arm Mobile Robot with Motorized Lift
Block Diffusion for Ultra-Fast Speculative Decoding
Generate Canvas, Excalidraw, and Mermaid diagrams from text
Korvus is a search SDK that unifies the entire RAG pipeline
A Personalized LLM-powered Agent Frameworks
Faster and easier training and deployments
Play couplet with seq2seq model
Running large language models on a single GPU
LongBench v2 and LongBench (ACL 25'&24')
Traditional Mandarin LLMs for Taiwan
LISA: Reasoning Segmentation via Large Language Model
Skywork-R1V is an advanced multimodal AI model series
A Comprehensive Benchmark to Evaluate LLMs as Agents (ICLR'24)
Self-learning data agent that grounds its answers in layers of content