With up to 25k MAUs and unlimited Okta connections, our Free Plan lets you focus on what you do best—building great apps.
You asked, we delivered! Auth0 is excited to expand our Free and Paid plans to include more options so you can focus on building, deploying, and scaling applications without having to worry about your security. Auth0 now, thank yourself later.
Try free now
Cloud tools for web scraping and data extraction
Deploy pre-built tools that crawl websites, extract structured data, and feed your applications. Reliable web data without maintaining scrapers.
Automate web data collection with cloud tools that handle anti-bot measures, browser rendering, and data transformation out of the box. Extract content from any website, push to vector databases for RAG workflows, or pipe directly into your apps via API. Schedule runs, set up webhooks, and connect to your existing stack. Free tier available, then scale as you need to.
an application to automatically extract text from comic books.
...It is based on the following 3 major algorithms
- Binarization of color images (Niblak and other methods)
- Connected components
- K-Means clustering
Apache Tesseract is used to perform Optical Character Recognition on the extracted text.
A subsequent version of the application will integrate with translation software in order to provide automated translation of comic book texts and re-inserion of translated texts