Showing 44 open source projects for "linux file parser"

View related business solutions
  • Our Free Plans just got better! | Auth0 Icon
    Our Free Plans just got better! | Auth0

    With up to 25k MAUs and unlimited Okta connections, our Free Plan lets you focus on what you do best—building great apps.

    You asked, we delivered! Auth0 is excited to expand our Free and Paid plans to include more options so you can focus on building, deploying, and scaling applications without having to worry about your security. Auth0 now, thank yourself later.
    Try free now
  • AI-powered service management for IT and enterprise teams Icon
    AI-powered service management for IT and enterprise teams

    Enterprise-grade ITSM, for every business

    Give your IT, operations, and business teams the ability to deliver exceptional services—without the complexity. Maximize operational efficiency with refreshingly simple, AI-powered Freshservice.
    Try it Free
  • 1
    pyVideoTrans

    pyVideoTrans

    Translate the video from one language to another and embed dubbing

    pyVideoTrans is an ambitious open-source multimedia processing project that assembles speech recognition, subtitle generation, AI translation, voice synthesis, and video assembly into a unified pipeline for converting videos from one language to another with embedded dubbing and captions. At its core it runs speech-to-text models to transcribe audio tracks, translates the resulting text into a target language using local or cloud-based translation engines, synthesizes new speech to match the...
    Downloads: 16 This Week
    Last Update:
    See Project
  • 2
    go-i18n

    go-i18n

    Translate your Go program into multiple languages

    go-i18n is a Go package and a command that helps you translate Go programs into multiple languages. Supports pluralized strings for all 200+ languages in the Unicode Common Locale Data Repository (CLDR). Code and tests are automatically generated from CLDR data. Supports strings with named variables using text/template syntax. Supports message files of any format (e.g. JSON, TOML, YAML). Use goi18n extract to extract all i18n.Message struct literals in Go source files to a message file for...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 3
    SPPAS

    SPPAS

    SPPAS - the automatic annotation and analyses of speech

    SPPAS is a scientific computer software package written and maintained by Brigitte Bigi of the Laboratoire Parole et Langage, in Aix-en-Provence, France. Available for free, with open source code, there is simply no other package for linguists to simple use in the automatic annotations of speech, the analyses of any kind of annotated data and the conversion of annotated files. SPPAS is able to produce automatically speech annotations from a recorded speech sound and its orthographic...
    Downloads: 16 This Week
    Last Update:
    See Project
  • 4
    Laravel Translation

    Laravel Translation

    Translation management for your Laravel application

    Laravel Translation is a package for Laravel that allows you full control over your translations when using Laravel's localization functionality. The package allows you to manage your translations using either native file based translations but also provides a database driver which is useful in multi-server setups. It exposes a user interface allowing you to update existing and add new translations to your application.
    Downloads: 0 This Week
    Last Update:
    See Project
  • Full-stack observability with actually useful AI | Grafana Cloud Icon
    Full-stack observability with actually useful AI | Grafana Cloud

    Our generous forever free tier includes the full platform, including the AI Assistant, for 3 users with 10k metrics, 50GB logs, and 50GB traces.

    Built on open standards like Prometheus and OpenTelemetry, Grafana Cloud includes Kubernetes Monitoring, Application Observability, Incident Response, plus the AI-powered Grafana Assistant. Get started with our generous free tier today.
    Create free account
  • 5
    The goal of this project is to investigate optimal ways to do genre classification for the ten indigenous South African languages. Funded by Dept of Arts and Culture of the SA Government. http://www.trifonius.co.za/projects/genre-classification
    Downloads: 0 This Week
    Last Update:
    See Project
  • 6

    Linguistic Analyzer

    The Linguistic Analyzer is a tool for corpus analysis and comparison

    The Linguistic Analyzer (Almuhalil Alloghawy) is a free tool designed by a team from Al-Imam Muhammad bin Saud islamic university that can be used for corpus analysis and comparison in terms of the several linguistic characteristics, such as frequency lists generation, concordances, collocation extraction, the difference between two words, and keyword identification.
    Downloads: 4 This Week
    Last Update:
    See Project
  • 7
    Leseratte is a Java parser for German written language. Currently, it contains a German lexicon (based on the Wiktionary), inflexion rules, a grammar and a parser. (Semantics component planned.) Usable as a Java library, also provides a graphical UI.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 8

    Ghawwas_V4

    An open source system for Arabic corpora processing

    Ghawwas (previously known as Khawas) is an open source system for Arabic corpora processing. Ghawwas V4.0 provides the following main functions: a. Frequency list for single word and N-Grams b. Concordance c. Collocation (MI, CHI Squared, LL, T-Score, Z Score, Dice, Log Dice, Weirdness Coefficient) d. Lexical patterns search e. Two corpora frequency profile comparison based on MI, CHI, LL, T-Score, Z Score, Dice, Log Dice, Weirdness Coefficient f. Accept Windows and UTF-8 character...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 9

    ParsPort

    ParsPort is a parsing tool for the Portuguese language.

    ParsPort is a parsing tool for the Portuguese language. It implements a set of perl scripts and CorpusSearch revision queries that allow to convert a POS-tagged file (CLAWS format) into a parsed file (Penn treebank format). ParsPort requires the installation of CorpusSearch2 and is optimized for UNIX (including macOS) and Linux operative systems. This parsing tool was developed at Centro de Llinguística da Universidade de Lisboa, within the P.S. Post Scriptum project, and is based on the one designed by Beatrice Santorini for the French language. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • Gemini 3 and 200+ AI Models on One Platform Icon
    Gemini 3 and 200+ AI Models on One Platform

    Access Google's best plus Claude, Llama, and Gemma. Fine-tune and deploy from one console.

    Build, govern, and optimize agents and models with Gemini Enterprise Agent Platform.
    Start Free
  • 10
    PangInput

    PangInput

    A simple tool for typing characters in different writing systems.

    PangInput is a simple application to help you in typing characters from different languages in unicode. Three methods are available: 1) a virtual keyboard, mapping specific characters to each key on your keyboard; 2) custom character sets, which you can select by clicking on them; 3) macro sets, allowing input of complex scripts - basically mapping a latin transcription to the actual writing of characters or words.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 11
    Pootle Django

    Pootle Django

    Online translation tool

    Pootle is written in Python using the Django framework and therefore can be installed on any web server that supports serving WSGI applications. Key localization file formats are supported, including Gettext PO, XLIFF, Java .properties, PHP arrays and many more supported by the Translate Toolkit. Ensures the best quality localizations by automatically detecting common errors made by translators. Integrates user suggestions, terminology, Translation Memory, Machine Translation, and more. A...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 12
    Corpus Toolkit

    Corpus Toolkit

    A text management tool for linguistic purposes...

    Downloads: 0 This Week
    Last Update:
    See Project
  • 13
    HermeneutiX

    HermeneutiX

    Your graphical tool for Syntactic/Semantic Structure Analysis of texts

    HermeneutiX is a tool for diagramming syntactic and semantic structures of complex (not necessarily foreign-language) texts (e.g. bible or other historical excerpts). HermeneutiX is now part of SciToS (the scientific tool set). Starting with version 2.0.0, HermeneutiX can be found on GitHub. Please check out the release summary: https://github.com/scientific-tool-set/scitos/releases For an introduction, check out this video: https://youtu.be/uQjewyG0Ad8 PS: To run a Java...
    Downloads: 3 This Week
    Last Update:
    See Project
  • 14

    ParSpa

    ParSpa is a parsing tool for the Spanish language.

    ParSpa is a parsing tool for Spanish language. It implements a set of perl scripts and CorpusSearch revision queries that allow to convert a POS-tagged file (CLAWS format) into a parsed file (Penn treebank format). ParSpa requires the installation of CorpusSearch2 and is optimized for UNIX (including macOS) and Linux operative systems. This parsing tool was developed at Centro de Linguística da Universidade de Lisboa, within the P.S. Post Sriptum project, and is based on the one designed by Beatrice Santorini for the French language. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 15
    Arramooz Alwaseet Arabic Dictionary
    Arramooz Alwaseet Open Arabic Dictionary for morphological analyze. To be useful for Arabic language processing. This dictionary is derived from the Ayaspell Arabic spell checker.
    Leader badge
    Downloads: 9 This Week
    Last Update:
    See Project
  • 16
    Marcion

    Marcion

    The study environment of ancient languages (Coptic, Greek, Latin)

    Marcion is a software forming a study environment of ancient languages (esp. Coptic, Greek, Latin) and providing many tools and resources (dictionaties, grammars, texts). Although Marcion is focused on to study the gnosticism and early christianity, it is an universal library working with various file formats and allowing to collect, organize and backup texts of any kind. Overview of gnostic sources in Coptic language delivered with Marcion: Nag Hammadi Library; Berlin Codex; Codex...
    Leader badge
    Downloads: 51 This Week
    Last Update:
    See Project
  • 17

    ooPorter

    A Porter stemming or stemmer algorithm coded in ooRexx

    This is an ooRexx line-by-line port from Ansi-C to ooRexx of the stemming routine published by Martin Porter 1980. The original source code from Porter has been commented out and emulated by the corresponding (oo)Rexx code as far as possible. This is not an example of good or fast (oo)Rexx programming, it is merely a demonstration of the Porter stemming routine ported to ooRexx, use and modify as necessary. Input to the program is a textfile with words to be stemmed, one per line....
    Downloads: 0 This Week
    Last Update:
    See Project
  • 18
    RoWordNetLib

    RoWordNetLib

    Java API for the Romanian WordNet

    Java API for the Romanian WordNet. Please note that the actual WordNet for Romanian (the XML file containing the network) is not included, it can be obtained (due to its license restrictions) from: http://ws.racai.ro:9191/repository/browse/romanian-wordnet-30/4611a43efb6811e2a8ad00237df3e3580b6b50d1111c4a6292694bded91d5c14/ **** If you would like a direct download of the API containing the RoWordNet xml file, please download them both from :...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 19
    Cross-platform application aimed at helping users to learn vocabulary from any foreign language(s). Add/Edit/Delete vocab words (w/ translation, category, sentence, notes, picture). Review (Quiz) vocabulary words.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 20
    Panglossa LexHoard

    Panglossa LexHoard

    A simple vocabulary builder with Unicode support.

    With LexHoard you can build vocabularies, glossaries, dictionaries or other types of wordlists. LexHoard features: -Unicode support (with a built-in Unicode editor) -export for MediaWiki and HTML format -attribution of more than one meaning/translation/relation to each word The program is written completely in native Lazarus/FreePascal. The only non-standard library needed is ZeosLib. It uses SQlite databases, so no special drivers or servers are required - although you can easily...
    Downloads: 1 This Week
    Last Update:
    See Project
  • 21
    This project concerns the development of human language technology resources, based on the approach to share or recycle resources between closely related language. http://gerhard.pro/closely-related-languages/
    Downloads: 0 This Week
    Last Update:
    See Project
  • 22

    mwetoolkit

    THIS PROJECT MIGRATED TO https://gitlab.com/mwetoolkit/mwetoolkit3/

    THIS PROJECT MIGRATED TO https://gitlab.com/mwetoolkit/mwetoolkit3/ The Multiword Expressions toolkit aids in the automatic identification and extraction of multiword units in running text. These include idioms (kick the bucket), noun compounds (cable car), phrasal verbs (take off, give up), etc. Even though it focuses on multiword expresisons, the framework is quite complete and can also be useful in any corpus-based study in computational linguistics. The mwetoolkit can be...
    Downloads: 1 This Week
    Last Update:
    See Project
  • 23

    eNTranslator

    To aid translation of satsangs of Paramhamsa Nithyananda

    To aid translation of satsangs of Paramhamsa Nithyananda. Can be used for general purpose by others as well. This translator desktop app uses google translator to translate English text. The auto generated translations are then enriched with human alternation using an easy graphical user interface. Time stamp information may be synched and a subtitle file or a simple textual output may be generated. Additionally it is planned to use google voice tools to also add voice over from these...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 24
    A project that aims to create reusable components (C++ libraries, COM components, and Edit controls) for Phonetic Transliteration of Indian languages, such as Telugu, Tamil, Kannada etc.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 25

    Dendrarium

    System do pielęgnacji składnikowych drzew składniowych

    Dendrarium służy do wybierania i weryfikacji składnikowych drzew składniowych generowanych przez parser Świgra. System jest użytkowany w Instytucie Podstaw Informatyki PAN do tworzenia banku drzew składniowych dla języka polskiego Składnica.
    Downloads: 0 This Week
    Last Update:
    See Project
  • Previous
  • You're on page 1
  • 2
  • Next
MongoDB Logo MongoDB