An Open Source text-to-speech system built by inverting Whisper
RGBD video generation model conditioned on camera input
Full stack AI software engineer
Agent Skill for generating 2D sprite sheets and map, transparent PNG
Open multimodal web agent built by Ai2
An MCP server for interacting with Google Colab
SQL-native memory layer enabling persistent context for AI agents
Multi-tool for semantic search
High-Performance Face Recognition Library on PaddlePaddle & PyTorch
Running large language models on a single GPU
LongBench v2 and LongBench (ACL 25'&24')
An LLM Compiler for Parallel Function Calling
Scalable RL solution for advanced reasoning of language models
Skywork-R1V is an advanced multimodal AI model series
Build a large language model from 0 only with Python foundation
Ling-V2 is a MoE LLM provided and open-sourced by InclusionAI
A simple yet powerful agent framework for personal assistants
Our first fully AI generated deep learning system
The absolute trainer to light up AI agents
Large Audio Language Model built for natural interactions
95% token savings. 155x faster queries. 16 languages
Framework for building neural networks
Fast and Universal 3D reconstruction model for versatile tasks
MobileLLM Optimizing Sub-billion Parameter Language Models
Ring is a reasoning MoE LLM provided and open-sourced by InclusionAI