Advanced NLP with spaCy: A free online course
Large Audio Language Model built for natural interactions
StreamSpeech is a seamless model for offline speech recognition
A framework to enable multimodal models to operate a computer
The behavior guidance framework for customer-facing LLM agents
Ready-to-use OCR with 80+ supported languages
Visual Causal Flow
Persian NLP Toolkit
A Web UI for easy subtitle using whisper model
Get your documents ready for gen AI
The no-nonsense RAG chunking library
Models for the spaCy Natural Language Processing (NLP) library
Powerful Android AI agent with tools, automation, and Linux shell
GLM-4.6V/4.5V/4.1V-Thinking, towards versatile multimodal reasoning
Real-time voice interactive digital human
An on-premises, OCR-free unstructured data extraction
AI-powered tool for generating, optimizing, and translating subtitles
Build voice-based LLM agents. Modular + open source
Capable of understanding text, audio, vision, video
Convert AI papers to GUI
Industrial-strength Natural Language Processing (NLP)
Integrating LLMs into structured NLP pipelines
AI assistant based on large models that can actively think and plan
Python Audio Analysis Library: Feature Extraction, Classification
A very simple framework for state-of-the-art NLP