The visual feedback tool for agents
End-to-end speech processing toolkit
Uncommon Objects in 3D dataset
State-of-the-art Image & Video CLIP, Multimodal Large Language Models
Scalable data pre processing and curation toolkit for LLMs
Python binding to the Apache Tika™ REST services
Give your AI agent access to your live Chrome session
Allow LLMs to control a browser with Browserbase and Stagehand
Running large language models on a single GPU
AI Powered Knowledge Graph Generator
A Model Context Protocol server for searching and analyzing arXiv
Adds powerful web scraping and search to Cursor and Claude
Assist in organizing your piles of documents
A lightweight 3D Morphable Face Model library in modern C++
NLP Cloud serves high performance pre-trained or custom models
A distributed system for embedding-based vector retrieval
Give Claude the ability to watch and understand videos
Extract structured data from webpages using LLM-powered scraping
A fast TTS architecture with conditional flow matching
Pokee Deep Research Model Open Source Repo
ktrain is a Python library that makes deep learning AI more accessible
A PyTorch-based Speech Toolkit
Build AI-powered semantic search applications
Chess application whichs allows working with chess PDF books and PGNs.
Geographic feature extraction and data mining