Extract and convert data from any document, images, pdfs, word doc
Self-hosted AI audio transcription
LLM framework for document understanding and semantic retrieval
Document (PDF, Word, PPTX ...) extraction and parse API
Uses Qwen3-ASR, local LLM, Whisper, TEN-VAD
An LLM-based presentation generation platform
RAG Web UI is an intelligent dialogue system based on RAG
Encoder of greater-than-word length text trained on a variety of data
Resources, corpora, and tools for Chinese natural language processing
Microsoft Distributed Machine Learning Toolkit
Natural Language Processing (NLP) for the Masses
Course materials for Georgia Tech CS 4650 and 7650
A Java Class Library for Text Processing
Text processing module for JCLAL
JSON based text search Java Project
Consilium – User Defined sentence Suggestion Tool.
Java API and tools for performing NLP and other AI tasks
CRFSharp is a .NET(C#) implementation of Conditional Random Field
CTC-based forced aligner for audio-text in 158 languages