AI-Powered Data Processing: Use LOTUS to process all of your datasets
A system for agentic LLM-powered data processing and ETL
Instantly generate AI-powered subtitles on your device
Open source NLP guide with models, methods, and real use cases
A suite of advanced multi-modal LLMs
Skills shared by Baoyu for improving daily work efficiency with Claude
Markdown parser, done right. 100% CommonMark support, extensions
Open Source Speech Language Model
Shared repository for open-sourced projects from the Google AI Lang
Implementing large models into scenario-based applications
SQL-Driven RAG Engine
AI-assisted storyboard and video generation tool
Automated translation solution for visual novels
Open-source multi-speaker long-form text-to-speech model
Towards Human-Sounding Speech
Knowledge Graph Generation from Any Text
The python library for real-time communication
Run local LLMs like llama, deepseek, kokoro etc. inside your browser
Semantic search and document parsing tools for the command line
Framework for building real-time voice and multimodal AI agents
Web-based tool converts GitHub repository contents
CLI tool for mapping organization network ranges using ASN data
Improve your resumes with Resume Matcher
Fast multimodal LLM for real-time voice interaction and AI apps
Diffusion Transformer with Fine-Grained Chinese Understanding