GLM-4.6V/4.5V/4.1V-Thinking, towards versatile multimodal reasoning
The official CLI to interact with Kaggle
26m function call model that runs on incredibly small devices
GPT Image 2 prompt gallery, image prompt library, agentic skill
All-in-one native macOS AI chat application
Self-evolving autonomous agent framework
Fast State-of-the-Art Static Embeddings
Workplace AI platform for enterprise search and workflow automation
The 100 line AI agent that solves GitHub issues
AI assistant for ComfyUI workflow generation, debugging, and tuning
Sample applications for Google Kubernetes Engine (GKE)
Multilingual Document Layout Parsing in a Single Vision-Language Model
Codes/Notebooks for AI Projects
A general fine-tuning kit geared toward image/video/audio diffusion
Pluggable SOTA multi-object tracking modules for segmentation
An efficient forwarding service designed for LLMs
The official implementation of RAPTOR
Pre & Post-training & Dataset & Evaluation & Depoly & RAG
How to optimize some algorithm in cuda
Web interface for searching and downloading books and audiobooks
Building an Intelligent Agent from Scratch
All-in-one AI framework & toolkit for Claude Code & Cursor
Document Index for Vectorless, Reasoning-based RAG
Harmonized and Coherent Human Image Animation
Run LLM prompts from your shell