Open speech-to-speech models and pipelines by Hugging Face toolkit AI
CogView4, CogView3-Plus and CogView3(ECCV 2024)
A Markdown-first memory system, a standalone library for any AI agent
CNCF Sandbox Project
Netease Youdao's open-source embedding and reranker models
Framework for building and orchestrating multi-agent AI systems
Empowering Code Generation with OSS-Instruct
Utilities intended for use with Llama models
Renderer for the harmony response format to be used with gpt-oss
Open Vision Agents by Stream. Build voice and vision agents quickly
Curated list of classic, high-quality computer science books
OpenTinker is an RL-as-a-Service infrastructure for foundation models
Multimodal embedding and reranking models built on Qwen3-VL
Scalable data pre processing and curation toolkit for LLMs
Foundational model for human-like, expressive TTS
Reference agents, skills, and data for the financial-services
Build a large language model from 0 only with Python foundation
General-purpose image editing model that delivers high-fidelity
Ling-V2 is a MoE LLM provided and open-sourced by InclusionAI
Security Scanner for Agent Skills
Chinese XLNet pre-trained model
Document Image Parsing via Heterogeneous Anchor Prompting”
4M: Massively Multimodal Masked Modeling
A Customizable Image-to-Video Model based on HunyuanVideo
Agent S: an open agentic framework that uses computers like a human