ChatGPT extension for scientific research work
An Open Source text-to-speech system built by inverting Whisper
Agentic, Reasoning, and Coding (ARC) foundation models
Persistent context and multi-instance coordination
MOSS‑TTS Family open‑source speech and sound generation model
Structured RAG: ingest, index, query
A specialized Claude Code workspace for creating long-form
Long-form streaming TTS system for multi-speaker dialogue generation
Fully Local Manus AI. No APIs, No $200 monthly bills
Claude Code skill for generating production-quality SVG+PNG technical
Ultimate meta-skill for generating best-in-class Claude Code skills
Large Multimodal Models for Video Understanding and Editing
OCR expert VLM powered by Hunyuan's native multimodal architecture
Generate high-definition story short videos with one click using AI
Your Personal Research Multi-Tool
Machine Learning Pipelines for Kubeflow
GLM-4.6V/4.5V/4.1V-Thinking, towards versatile multimodal reasoning
Open Multilingual Multimodal Chat LMs
Guiding Instruction-based Image Editing via Multimodal Large Language
No-code tool for creating a neural search solution in minutes
WaveRNN Vocoder + TTS
A PyTorch implementation of "Capsule Graph Neural Network"
Code base for the precision, recall, density, and coverage metrics
Code for "Image Generation from Scene Graphs", Johnson et al, CVPR 201
Evaluating state of the art in AI