A python module to repair invalid JSON from LLMs
The data structure for multimodal data
Synthetic Data Generation for tabular, relational and time series data
Toloka-Kit is a Python library for working with Toloka API
A high-quality tool for convert PDF to Markdown and JSON
AlphaFold 3 inference pipeline
Structured data extraction and instruction calling with ML, LLM
Deterministic LLMs Outputs for AI Applications and AI Agents
No-code LLM Platform to launch APIs and ETL Pipelines
Focus on creating classic Python small examples and cases
Voice Recognition to Text Tool
OCR model for complex documents with layout-aware structured outputs
Qwen2.5-VL is the multimodal large language model series
Automate browser-based workflows with LLMs and Computer Vision
Test-Time Reinforcement Learning
Swirl queries any number of data sources with APIs
The ChatGPT Retrieval Plugin lets you easily find personal documents
OCR expert VLM powered by Hunyuan's native multimodal architecture
Did you say you like data?
Run GGUF models easily with a UI or API. One File. Zero Install.
A Unified Toolkit for Deep Learning Based Document Image Analysis
Machine learning tool that allows you to train and test models
An easy to use Neural Search Engine
Jupyter Notebook tutorials for REINVENT 3.2