Spring Batch is a framework for writing batch applications using Java
Toolkit for conversational AI
Modest natural-language processing
Industrial-strength Natural Language Processing (NLP)
Library to encode and decode images in WebP format
Docker image used to run data processing workloads
State of the Art Natural Language Processing
HTML Loader
iLovePDF Rest Api - PHP Library
Open Source Differentiable Computer Vision Library
ExtractThinker is a Document Intelligence library for LLMs
Non-Blocking Reactive Foundation for the JVM
Message Queue and Batch processing for NodeJS and Python
CV-CUDA™ is an open-source, GPU accelerated library
Video editing with Python
The Classical Language Toolkit
Data and tools for generating and inspecting OLMo pre-training data
A curated list of data mining papers about fraud detection
Training data (data labeling, annotation, workflow) for all data types
The lxml XML toolkit for Python
Parser generator to read, process, or translate structured text
Production-ready data processing made easy and shareable
The most accurate natural language detection library for Python
Optax is a gradient processing and optimization library for JAX
Data-Centric Pipelines and Data Versioning