A PyTorch-based Speech Toolkit
Formula recognition based on LaTeX-OCR and ONNXRuntime
Semantic search and workflows for medical/scientific papers
Build voice-based LLM agents. Modular + open source
Jittor is a high-performance deep learning framework
Underthesea - Vietnamese NLP Toolkit
A full spaCy pipeline and models for scientific/biomedical documents
Multi-Scale Fusion of Locally-Global Descriptors for Place Recognition
Label, clean and enrich text datasets with LLMs
OpenMMLab's Next Generation Video Understanding Toolbox and Benchmark
CLI tool to extract (meta)data from PDF and manipulate PDF files
An open source object detection toolbox based on PyTorch
Obsei is a low code AI powered automation tool
Persian NLP Toolkit
The no-nonsense RAG chunking library
Multilingual Automatic Speech Recognition with word-level timestamps
Training data (data labeling, annotation, workflow) for all data types
2D and 3D Face alignment library build using pytorch
Conversational voice AI agents
The behavior guidance framework for customer-facing LLM agents
Data manipulation and transformation for audio signal processing
A very simple framework for state-of-the-art NLP
Integrating LLMs into structured NLP pipelines
Internationalized highly customizable annotation and evaluation tool
Stanford NLP Python library for many human languages