OCR offline image text recognition command line windows program
A python module to repair invalid JSON from LLMs
Fantasy Premier League MCP Server
A high-quality tool for convert PDF to Markdown and JSON
The data structure for multimodal data
Nexa SDK is a comprehensive toolkit for supporting ONNX and GGML
Open platform for sharing and discovering Stable Diffusion models
Toloka-Kit is a Python library for working with Toloka API
Export and Share your ChatGPT conversation history
Token-Oriented Object Notation (TOON)
Making ALL Software Agent-Native
Crawl a website starting from a URL, find relevant pages
Distribute and run LLMs with a single file
OCR model for complex documents with layout-aware structured outputs
Command-line tool for Drive, Gmail, Calendar, Sheets, Docs, Chat, etc.
Qwen2.5-VL is the multimodal large language model series
Voice Recognition to Text Tool
Open-source, code-first Python toolkit for building, evaluating, etc.
A Protocol for Agent-Driven Interfaces
AI Browser Agent is an advanced Browser AI tool
Open source visual editor for building React drag-and-drop pages
Focus on creating classic Python small examples and cases
Extract and convert data from any document, images, pdfs, word doc
Making Enterprise Data Intelligent and Responsive for AI
Extract structured data from webpages using LLM-powered scraping