Showing 13 open source projects for "scripts"

View related business solutions
  • Our Free Plans just got better! | Auth0 Icon
    Our Free Plans just got better! | Auth0

    With up to 25k MAUs and unlimited Okta connections, our Free Plan lets you focus on what you do best—building great apps.

    You asked, we delivered! Auth0 is excited to expand our Free and Paid plans to include more options so you can focus on building, deploying, and scaling applications without having to worry about your security. Auth0 now, thank yourself later.
    Try free now
  • Cloud tools for web scraping and data extraction Icon
    Cloud tools for web scraping and data extraction

    Deploy pre-built tools that crawl websites, extract structured data, and feed your applications. Reliable web data without maintaining scrapers.

    Automate web data collection with cloud tools that handle anti-bot measures, browser rendering, and data transformation out of the box. Extract content from any website, push to vector databases for RAG workflows, or pipe directly into your apps via API. Schedule runs, set up webhooks, and connect to your existing stack. Free tier available, then scale as you need to.
    Explore 10,000+ tools
  • 1
    TXM

    TXM

    Unicode XML TEI text analysis platform

    TXM is a free and open-source cross-platform Unicode & XML based text analysis environment and graphical client, supporting Windows, Linux and Mac OS X. It can also be used online as a J2EE standard compliant web portal (GWT based) with access control built in. DOWNLOAD LATEST VERSION OF TXM : http://textometrie.ens-lyon.fr/spip.php?rubrique61&lang=en TXM offers a comprehensive range of analysis tools (concordances, collocate search, frequency lists, etc.) based on the powerfull CQP...
    Leader badge
    Downloads: 19 This Week
    Last Update:
    See Project
  • 2
    UnsupervisedMT

    UnsupervisedMT

    Phrase-Based & Neural Unsupervised Machine Translation

    ...Training relies on denoising auto-encoding and back-translation, with on-the-fly, multithreaded generation of synthetic parallel data to continually refresh supervision signals. The project also provides scripts to fetch and preprocess monolingual data, learn BPE codes, and train cross-lingual embeddings that bootstrap unsupervised alignment between languages. Beyond the core EMNLP 2018 setup, the codebase exposes additional, optional capabilities such as multi-language training, language model pretraining with shared parameters, and adversarial training.
    Downloads: 3 This Week
    Last Update:
    See Project
  • 3

    ParsPort

    ParsPort is a parsing tool for the Portuguese language.

    ParsPort is a parsing tool for the Portuguese language. It implements a set of perl scripts and CorpusSearch revision queries that allow to convert a POS-tagged file (CLAWS format) into a parsed file (Penn treebank format). ParsPort requires the installation of CorpusSearch2 and is optimized for UNIX (including macOS) and Linux operative systems. This parsing tool was developed at Centro de Llinguística da Universidade de Lisboa, within the P.S.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 4
    PangInput

    PangInput

    A simple tool for typing characters in different writing systems.

    ...Three methods are available: 1) a virtual keyboard, mapping specific characters to each key on your keyboard; 2) custom character sets, which you can select by clicking on them; 3) macro sets, allowing input of complex scripts - basically mapping a latin transcription to the actual writing of characters or words.
    Downloads: 0 This Week
    Last Update:
    See Project
  • Atera all-in-one platform IT management software with AI agents Icon
    Atera all-in-one platform IT management software with AI agents

    Ideal for internal IT departments or managed service providers (MSPs)

    Atera’s AI agents don’t just assist, they act. From detection to resolution, they handle incidents and requests instantly, taking your IT management from automated to autonomous.
    Learn More
  • 5

    ParSpa

    ParSpa is a parsing tool for the Spanish language.

    ParSpa is a parsing tool for Spanish language. It implements a set of perl scripts and CorpusSearch revision queries that allow to convert a POS-tagged file (CLAWS format) into a parsed file (Penn treebank format). ParSpa requires the installation of CorpusSearch2 and is optimized for UNIX (including macOS) and Linux operative systems. This parsing tool was developed at Centro de Linguística da Universidade de Lisboa, within the P.S.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 6
    Linux Guist - Multi Lingual OS for Asia

    Linux Guist - Multi Lingual OS for Asia

    A Single Click Language Changer and Publishing System for Web and DTP

    Linux Guist - is a Multi Lingual Live CD OS for most Asian Languages, with the ability to run of a CD & Old Hardware, with just 128 MB Memory, for DTP, Web Publishing & Data Entry purposes. This will help IT employers to take up Govt. Projects that require Data Collection, Entry & Publishing at a very very low cost, while providing Training & Job Opportunities to numerous students of these languages, in the various towns, of the country. Talk to your respective IT/HRD ministry to identify...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 7

    t2t-pipe

    automatic alignment pipeline for parallel treebanks

    The *Tree-to-Tree (t2t) Alignment Pipe* is a collection of python scripts, co-ordinating the process of automatic alignment of parallel treebanks from plain text files with a single call from a unix command line. Supported Languages: DE, FR, EN
    Downloads: 0 This Week
    Last Update:
    See Project
  • 8
    Various tools for creating annotated parallel corpora including pre-trained tagging and parsing models for various languages, sentence alignment tools and word alignment tools. Uplug also includes a web-based interface for interactive sentence and word alignment and scripts for indexing and querying parallel corpora using the Corpus Work Bench CWB. Download 'uplug-main' first and then add other packages.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 9
    AzConvert is an open source program to convert different scripts of Azerbaijani language (Latin, Arabic and Cyrillic) to each other. It's written in Qt.
    Downloads: 2 This Week
    Last Update:
    See Project
  • Grafana: The open and composable observability platform Icon
    Grafana: The open and composable observability platform

    Faster answers, predictable costs, and no lock-in built by the team helping to make observability accessible to anyone.

    Grafana is the open source analytics & monitoring solution for every database.
    Learn More
  • 10
    Australian National Corpus

    Australian National Corpus

    An ongoing project to collate and provide access to language data

    Includes • Scripts for the program/ code developed • High level architecture diagrams • Install guides for developers • Links to end user documentation on the AusNC website Note: The BSD license applies to customised plug-ins, scripts and ingest programs developed by the AusNC project team. Additional open source, 3rd party software products used by the AusNC solution are referenced on our SF wiki space.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 11
    Kurdi Nus
    A portable, platform-independent, open source tool for converting different Kurdish scripts
    Downloads: 0 This Week
    Last Update:
    See Project
  • 12
    Varamozhi is a free English-Malayalam transliteration library. It can transliterate Malayalam text between Malayalam and English scripts. Varamozhi takes as the input, the mapping between a Malayalam font and a transliteration scheme; outputs functions i
    Leader badge
    Downloads: 109 This Week
    Last Update:
    See Project
  • 13
    A public repository of open source scripts and small programs related to linguistics and language.
    Downloads: 0 This Week
    Last Update:
    See Project
  • Previous
  • You're on page 1
  • Next