Showing 29 open source projects for "text based"

View related business solutions
  • Go From AI Idea to AI App Fast Icon
    Go From AI Idea to AI App Fast

    One platform to build, fine-tune, and deploy ML models. No MLOps team required.

    Access Gemini 3 and 200+ models. Build chatbots, agents, or custom models with built-in monitoring and scaling.
    Try Free
  • Full-stack observability with actually useful AI | Grafana Cloud Icon
    Full-stack observability with actually useful AI | Grafana Cloud

    Our generous forever free tier includes the full platform, including the AI Assistant, for 3 users with 10k metrics, 50GB logs, and 50GB traces.

    Built on open standards like Prometheus and OpenTelemetry, Grafana Cloud includes Kubernetes Monitoring, Application Observability, Incident Response, plus the AI-powered Grafana Assistant. Get started with our generous free tier today.
    Create free account
  • 1
    IMS Open Corpus Workbench

    IMS Open Corpus Workbench

    Indexing and query tools for very large text corpora

    The IMS Open Corpus Workbench is a collection of tools for managing and querying large text corpora (100 M words and more) with linguistic annotations. Its central component is the flexible and efficient query processor CQP, which can be used interactively in a terminal session, as a backend e.g. from a Perl script, or through the Web-based GUI CQPweb.
    Leader badge
    Downloads: 29 This Week
    Last Update:
    See Project
  • 2
    LWT ◆ Learning With Texts [Official]

    LWT ◆ Learning With Texts [Official]

    A feature-rich web application for language learning through reading

    LWT is a tool for Language Learning, inspired by: - Stephen Krashen's principles in Second Language Acquisition, - Steve Kaufmann's LingQ application, and - ideas from Khatzumoto - published at AJATT - All Japanese All The Time. You define languages you want to learn and import texts you want to use for learning. While listening to the audio (optional), you read the text, save, review and test words or multi-word expressions. In new texts all your previously saved words and expressions...
    Leader badge
    Downloads: 15 This Week
    Last Update:
    See Project
  • 3

    Polish to English translation aid

    Polish to English translation aid. Automatic dictionary presentation.

    ...In this way a growing library of Polish texts with inbuilt translatiion assistance could be produced. To begin with one html document is presented, a historical Polish text from the 19th century. The javascript code is just working and would benefit from expert help. Also help is needed is in building up the dictionary resources that are used to create the annotated text. These are based on other dictionaries and there may be copyright issues.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 4

    Syntax Untangler

    Teach your students how to figure out tricky texts in any language.

    Web-based activity that asks the learner to visually mark up a short primary text in any language, in order to improve small-scale reading skills. Students get instant feedback to actions. Instructors use Web-based authoring interface to write and publish their content and questions in any language (Unicode).
    Downloads: 0 This Week
    Last Update:
    See Project
  • $300 in Free Credit Towards Top Cloud Services Icon
    $300 in Free Credit Towards Top Cloud Services

    Build VMs, containers, AI, databases, storage—all in one place.

    Start your project in minutes. After credits run out, 20+ products include free monthly usage. Only pay when you're ready to scale.
    Get Started
  • 5
    multipractice

    multipractice

    Tool for practicing languages.

    Practice makes perfect. Panglossa MultiPractice is a tool to help you learn and practice languages. You can create your own courses, import courses created by others, and even export courses to HTML or PDF documents. The original project (for Lazarus) was basically a flashcard app. Now it is more like a platform for creating structured courses with text, images, audio and video content, as well as different types of exercises. Please keep in mind that this project is created and...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 6
    jieba

    jieba

    Stuttering Chinese word segmentation

    "Jaba" Chinese word segmentation, do the best Python Chinese word segmentation component. Four word segmentation modes are supported. Precise mode, which tries to cut the sentence most precisely, suitable for text analysis. Full mode, scans all the words that can be formed into words in the sentence, the speed is very fast, but the ambiguity cannot be resolved. The search engine mode, on the basis of the precise mode, divides the long words again to improve the recall rate, which is suitable...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 7
    Fresh Memory

    Fresh Memory

    Flashcards application with Spaced Repetition method

    Fresh Memory is an application that helps to learn large amounts of any material with Spaced Repetition method. The most important subject is learning foreign words, but Fresh Memory can be also used to learn anything else. The learning data is stored as flash cards and dictionaries. The flash cards may have several fields, and the user controls what combination of fields to learn. The flashcards can have formatted text and images.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 8
    PangInput

    PangInput

    A simple tool for typing characters in different writing systems.

    PangInput is a simple application to help you in typing characters from different languages in unicode. Three methods are available: 1) a virtual keyboard, mapping specific characters to each key on your keyboard; 2) custom character sets, which you can select by clicking on them; 3) macro sets, allowing input of complex scripts - basically mapping a latin transcription to the actual writing of characters or words.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 9
    subs2srs

    subs2srs

    Convert movies and TV shows to flashcards

    subs2srs allows you to create import files for Anki or other Spaced Repetition Systems (SRS) based on your favorite foreign language movies and TV shows to aid in the language learning process. See http://subs2srs.sourceforge.net/ for more information.
    Leader badge
    Downloads: 39 This Week
    Last Update:
    See Project
  • Earn up to 16% annual interest with Nexo. Icon
    Earn up to 16% annual interest with Nexo.

    Access competitive interest rates on your digital assets.

    Generate interest, borrow against your crypto, and trade a range of cryptocurrencies — all in one platform. Geographic restrictions, eligibility, and terms apply.
    Get started with Nexo.
  • 10
    HermeneutiX

    HermeneutiX

    Your graphical tool for Syntactic/Semantic Structure Analysis of texts

    HermeneutiX is a tool for diagramming syntactic and semantic structures of complex (not necessarily foreign-language) texts (e.g. bible or other historical excerpts). HermeneutiX is now part of SciToS (the scientific tool set). Starting with version 2.0.0, HermeneutiX can be found on GitHub. Please check out the release summary: https://github.com/scientific-tool-set/scitos/releases For an introduction, check out this video: https://youtu.be/uQjewyG0Ad8 PS: To run a Java...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 11

    Musaheb

    An Arabic collocation extraction tool

    “Musaheb”, an Arabic collocation extraction tool that has been designed and implemented to overcome the limitations of existing collocation extraction tools. “Musaheb” is able to extract n-gram collocations up to 5-gram, in addition to extracting the collocates of the nodes (the word-types we are looking for its collocates) within a window size of zero to 15 words. Moreover, it provides eight collocation statistics to calculate the strength of the collocation, and permits the input of...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 12
    The beginnings of a word-processor coupled with a database. These programs are used by the translation industry to check the accuraccy of a translation by running the associated word pairs through a machine translation program. Will now open the same files it closes with.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 13
    Swedish-English Open Dictionary

    Swedish-English Open Dictionary

    Swe-Eng Kindle Dictionary Based on Folkets Lexikon

    This is a transcoded and reformatted version of "The People's Dictionary" (Folkets Lexikon, see http://folkets-lexikon.csc.kth.se for more information). The project contains encoding and formatting scripts along with instructions for converting the dictionary database into a Kindle book. The dictionary is in the Kindle's native dictionary format and integrates with the Smart Lookup feature which allows you to access words definitions without leaving your book page - just press and hold to...
    Downloads: 4 This Week
    Last Update:
    See Project
  • 14
    MySQL English Dictionary

    MySQL English Dictionary

    A Full English - English dictionary in MySQL Format

    A dictionary with 176023 entries. Text was extracted from the files at http://www.mso.anu.edu.au/~ralph/OPTED/ and then parsed and stored in a 16MB MySQL database. The database has three fields : a. word b. wordtype and c. definition. You can use this standalone or as a jquery/ajax/PHP addon for your programs. Acknowledgment of the original content: a. OPTED b. Project Gutenburg c. and the 1913 edition of Webster's Unabridged Dictionary
    Downloads: 10 This Week
    Last Update:
    See Project
  • 15
    LunaIDE is a powerfull IDE for Lua programming language and also have support for XML files. Languages: Spanish, English and Portuguese. Note: Lua is licenced under the MIT License and LunaIDE is licensed under the GNU/GPL v3. The creator of this software do not have any kind of relation with PUC-Rio.
    Downloads: 14 This Week
    Last Update:
    See Project
  • 16
    Japanese Text Analysis Tool

    Japanese Text Analysis Tool

    Generate frequency and readability reports from Japanese texts.

    cb's Japanese Text Analysis Tool allows users to analyze Japanese text files and generate 4 kinds of reports: 1) Word Frequency Report, 2) Kanji Frequency Report, 3) Formula-based Readability Report, 4) User-based Readability Report. Portable and does not require installation.
    Leader badge
    Downloads: 14 This Week
    Last Update:
    See Project
  • 17

    FindAKanji

    a tool to learn about Chinese (and Japanese) characters

    ...The user is encouraged to extend a database with acquired information about Chinese characters and Japanese Words. This information may turn out useful in the integrated text editor, for both reading and writing. So there are multiple different uses for this program: - Learn about Chinese characters. Gain, store and memorize knowledge about specific characters and words. - Input devices for Chinese (and Japanese) characters. For use without much prior knowledge. - Working with Japanese text, with powerful help. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 18
    Bulgarian-English Open Dictionary

    Bulgarian-English Open Dictionary

    BG-EN Kindle Dictionary Based on BG Office

    The project contains scripts for converting BG Office Dictionary (see http://bgoffice.sourceforge.net/) database into a Kindle book. The dictionary is in the Kindle's native dictionary format and integrates with the Smart Lookup feature. Please consider making a donation by buying the dictionary on Amazon: http://www.amazon.com/s/ref=series_rw_dp_labf?_encoding=UTF8&field-collection=Open+Source+Bulgarian-English+and+English-Bulgarian+Dictionaries&url=search-alias%3Ddigital-text (Този...
    Downloads: 5 This Week
    Last Update:
    See Project
  • 19
    Whitaker's Words Latin Dictionary

    Whitaker's Words Latin Dictionary

    Latin dictionary and grammar aid: Latin to English, English to Latin.

    Written in the Ada programming language, William Whitaker's Words Latin dictionary provides definitions and grammatical analysis of words found in Latin texts. It can deduce the dictionary form of a word based on the form actually found in a text. It can handle Latin words, phrases, or whole files. The dictionary contains some 39000 entries, as would be counted in an ordinary dictionary. This may generate many hundreds of thousands of 'words' that one can construct over all the declensions and conjugations. A few hundred prefixes and suffixes further enlarge the range. ...
    Downloads: 12 This Week
    Last Update:
    See Project
  • 20
    MangaED is a universal program for manga translators. It consists of text processor, image viewer, dictionary and kanji finder
    Downloads: 0 This Week
    Last Update:
    See Project
  • 21
    LinqYedict

    LinqYedict

    Translate Chinese to English

    Translate Chinese to English using CEDICT (cantonese dictionary). Demonstrate the speed of C# and Linq. Copy the chinese text from any browser/application to Windows clipboard and see the translation.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 22

    Japanese Dictionary

    Japanese-to-English dictionary written in Java.

    The Japanese to English Dictionary is written in java, and so will require that you have the JRE installed prior to its installation & use. It is a hobbyist project of mine intended to address my admittedly narrow needs, and so will fall short of yours. If you have any suggestions for improvements, please let me know! This open source application would not be possible without the Japanese-English dictionary and kanji files assembled by Jim Breen; it would also have taken a great deal...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 23

    Words finder

    Project used SQLite as provider and SubSonic as ORM, whaching text ...

    Project use SQLite as provider and SubSonic as ORM, watch text and find word with some word parts, also find some statistic data as frequency and so on.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 24
    Downloadable and open source Chinese-Spanish vocabulary inspired by the CEDICT and EDICT dictionaries. It is distributed in a plain Unicode text file that can be easily ported to other formats or used by different applications.
    Downloads: 5 This Week
    Last Update:
    See Project
  • 25
    VocLearner
    A software to help users to learn vocabulary lists (from one language to another) more efficiently. They can create several lists, and learn them. The software saves the lists, but also the knowledge level of the user for each sentence of the lists.
    Downloads: 0 This Week
    Last Update:
    See Project
  • Previous
  • You're on page 1
  • 2
  • Next
MongoDB Logo MongoDB