Enhanced version of the standard Unix strings(1) program which uses language models for automatic language identification and character-set identification, supporting over 1400 languages, dozens of character encodings, and 4800+ language/encoding pairs.

Features

  • text extraction
  • language identification
  • character-set identification

Project Activity

See All Activity >

Categories

Search

License

GNU General Public License version 3.0 (GPLv3), Creative Commons Attribution Non-Commercial License V2.0

Follow Language-Aware String Extractor

Language-Aware String Extractor Web Site

You Might Also Like
Optimize every aspect of hiring with Greenhouse Recruiting Icon
What’s next for many of us is changing. Your company’s ability to hire great talent is as important as ever – so you’ll be ready for whatever’s ahead. Whether you need to scale your team quickly or improve your hiring process, Greenhouse gives you the right technology, know-how and support to take on what’s next.
Rate This Project
Login To Rate This Project

User Ratings

★★★★★
★★★★
★★★
★★
1
0
0
0
0
ease 1 of 5 2 of 5 3 of 5 4 of 5 5 of 5 0 / 5
features 1 of 5 2 of 5 3 of 5 4 of 5 5 of 5 0 / 5
design 1 of 5 2 of 5 3 of 5 4 of 5 5 of 5 0 / 5
support 1 of 5 2 of 5 3 of 5 4 of 5 5 of 5 0 / 5

User Reviews

  • Thanks for software and updates.
Read more reviews >

Additional Project Details

Operating Systems

Linux, BSD

Languages

English

Intended Audience

Advanced End Users, Developers

User Interface

Command-line

Programming Language

C++

Related Categories

C++ Search Software

Registered

2011-07-22