An on-premises, OCR-free unstructured data extraction
Hackable and optimized Transformers building blocks
GLM-4.6V/4.5V/4.1V-Thinking, towards versatile multimodal reasoning
Your open-source LLM evaluation toolkit
code for Mesh R-CNN, ICCV 2019
Reflexion: Language Agents with Verbal Reinforcement Learning
Build production-ready AI agents in both Python and Typescript
A general fine-tuning kit geared toward image/video/audio diffusion
Automatic question answering for local knowledge bases based on LLM
Ling-V2 is a MoE LLM provided and open-sourced by InclusionAI
Foundation Models for Time Series
MapAnything: Universal Feed-Forward Metric 3D Reconstruction
Generating Immersive, Explorable, and Interactive 3D Worlds
Definitions for AI/ML tasks like dataset creation
Operating LLMs in production
An Efficient Web-enhanced Question Answering System
Empowering Code Generation with OSS-Instruct
Tiny vision language model
Data Lake for Deep Learning. Build, manage, and query datasets
Sparsity-aware deep learning inference runtime for CPUs
SimpleMem: Efficient Lifelong Memory for LLM Agents
Eva is an A.I. assistant that helps users multi-task.
GLM-4.6V/4.5V/4.1V-Thinking, towards versatile multimodal reasoning
"VideoRAG: Chat with Your Videos
Minimal Claude Code alternative. Single Python file, zero dependencies