NLTK Source
PDFCraft is a free, privacy-focused PDF toolkit
AI-Powered Data Processing: Use LOTUS to process all of your datasets
Instantly generate AI-powered subtitles on your device
A system for agentic LLM-powered data processing and ETL
Open source NLP guide with models, methods, and real use cases
A suite of advanced multi-modal LLMs
TextWorld is a sandbox learning environment for the training
Markdown parser, done right. 100% CommonMark support, extensions
Open Source Speech Language Model
Implementing large models into scenario-based applications
SQL-Driven RAG Engine
AI-assisted storyboard and video generation tool
Framework for building real-time voice and multimodal AI agents
Knowledge Graph Generation from Any Text
The python library for real-time communication
Run local LLMs like llama, deepseek, kokoro etc. inside your browser
Open-source multi-speaker long-form text-to-speech model
Towards Human-Sounding Speech
Automated translation solution for visual novels
A simple and easy-to-use but yet powerful line-oriented text editor.
Web-based tool converts GitHub repository contents
Semantic search and document parsing tools for the command line
CLI tool for mapping organization network ranges using ASN data
Skills shared by Baoyu for improving daily work efficiency with Claude