cbrTekStraktor

cbrTekStraktor is an application to automatically extract text from the text bubbles or speech balloons present in comic book reader files (CBR). Its prime goal is to perform analysis on the texts of comic books. cbrTekStraktor can however also be used for scanlation or similar purposes.

The application also enables to manually define text areas in CBR files. The application comprises a simple graphical editor for further processing the extracted text.

The text extraction is achieved by a combination of statistical and graphical processing operations. It is based on the following 3 major algorithms
- Binarization of color images (Niblak and other methods)
- Connected components
- K-Means clustering

Apache Tesseract is used to perform Optical Character Recognition on the extracted text.

A subsequent version of the application will integrate with translation software in order to provide automated translation of comic book texts and re-inserion of translated texts

Features

java
image processing
text extraction
comic book
statistics
OCR

Project Samples

Project Activity

See All Activity >

Follow cbrTekStraktor

cbrTekStraktor Web Site

Other Useful Business Software

$300 in Free Credit Towards Top Cloud Services

Build VMs, containers, AI, databases, storage—all in one place.

Start your project in minutes. After credits run out, 20+ products include free monthly usage. Only pay when you're ready to scale.

Get Started

Rate This Project

User Reviews

Be the first to post a review of cbrTekStraktor!

Additional Project Details

Operating Systems

Linux, Mac, Windows

Intended Audience

End Users/Desktop

User Interface

Java Swing

Programming Language

Java

Related Categories

Java Image Converters, Java OCR Software, Java Image Processing Software

Registered

2017-06-04

Similar Business Software

SmartDraw

SmartDraw makes professional drawings and diagrams accessible to everyone. Non-technical users can quickly create floor plans, while professionals get the precision and scale they require. With industry-leading floor planning tools and an intuitive interface for traditional diagramming like...

See Software
Google Cloud Speech-to-Text

Google Cloud’s Speech API processes more than 1 billion voice minutes per month with close to human levels of understanding for many commonly spoken languages. Powered by the best of Google's AI research and technology, Google Cloud's Speech-to-Text API helps you accurately transcribe speech...

See Software
Square 9

Square 9 removes the frustration of extracting data from documents, forms, and all external sources, so you can harness the full power of your information. Release your team from repetitive tasks while your work flows freely in areas like Accounts Payable, Order Processing, Customer and...

See Software
Nutrient SDK

Nutrient is the comprehensive solution for all your PDF needs, offering tools that effortlessly integrate and operate PDF functionality across any platform. 1. SDK PRODUCTS Integrate robust PDF functionality into iOS, Android, Windows, web (JavaScript), or any cross-platform technology,...

See Software
Picsart Enterprise

AI-Powered Image & Video Editing for Seamless Integration. Enhance your visual content workflows with Picsart Creative APIs, a robust suite of AI-driven tools for developers, product owners, and entrepreneurs. Easily integrate advanced image and video processing capabilities into your...

See Software
Acuity Scheduling

Make online appointment scheduling and management a breeze with Acuity Scheduling. Acuity Scheduling is an easy to use online appointment scheduling software that helps professionals and businesses fill their schedule--minus all the hassle. With Acuity, clients see your real-time availability,...

See Software