Showing 34 open source projects for "tagging"

View related business solutions
  • Gemini 3 and 200+ AI Models on One Platform Icon
    Gemini 3 and 200+ AI Models on One Platform

    Access Google's best plus Claude, Llama, and Gemma. Fine-tune and deploy from one console.

    Build generative AI apps with Vertex AI. Switch between models without switching platforms.
    Start Free
  • MongoDB Atlas runs apps anywhere Icon
    MongoDB Atlas runs apps anywhere

    Deploy in 115+ regions with the modern database for every enterprise.

    MongoDB Atlas gives you the freedom to build and run modern applications anywhere—across AWS, Azure, and Google Cloud. With global availability in over 115 regions, Atlas lets you deploy close to your users, meet compliance needs, and scale with confidence across any geography.
    Start Free
  • 1
    AlohaMini

    AlohaMini

    Open-Source Dual-Arm Mobile Robot with Motorized Lift

    AlohaMini is a lightweight self-hostable productivity and note-taking platform designed to help individuals organize thoughts, tasks, and knowledge in a fast, minimal interface that avoids unnecessary complexity. It combines hierarchical notebooks, rich text editing, and tagging, allowing users to structure information the way they think and refine that structure over time. The platform focuses on speed and responsiveness, with a UI that loads instantly and feels smooth even on modest hardware, making it ideal for both desktop and small server deployments. AlohaMini also supports internal linking between notes, enabling a web-like connected knowledge space that grows organically as users create and connect ideas. ...
    Downloads: 1 This Week
    Last Update:
    See Project
  • 2
    tag-my-dataset

    tag-my-dataset

    Program to tagging a dataset

    A graphic user interface program to work with labels and images in a dataset.
    Downloads: 4 This Week
    Last Update:
    See Project
  • 3
    Unitag is a language-independent Unicode-based part-of-speech tagging system. Written entirely in ANSI-compatible C, it should (in theory) compile on any OS, but has been tested on 32-bit Windows.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 4

    MITRE Annotation Toolkit

    A toolkit for managing and manipulating text annotations

    The MITRE Annotation Toolkit (MAT) is a suite of tools which can be used for automated and human tagging of annotations. Annotation is a process, used mostly by researchers in natural language processing, of enhancing documents with information about the various phrase types the documents contain. MAT supports both UI interaction and command-line interaction, and provides various levels of control over the overall annotation process.
    Downloads: 5 This Week
    Last Update:
    See Project
  • Full-stack observability with actually useful AI | Grafana Cloud Icon
    Full-stack observability with actually useful AI | Grafana Cloud

    Our generous forever free tier includes the full platform, including the AI Assistant, for 3 users with 10k metrics, 50GB logs, and 50GB traces.

    Built on open standards like Prometheus and OpenTelemetry, Grafana Cloud includes Kubernetes Monitoring, Application Observability, Incident Response, plus the AI-powered Grafana Assistant. Get started with our generous free tier today.
    Create free account
  • 5

    KSUCCA Corpus

    A 50 million tokens corpus of Classical Arabic.

    King Saud University Corpus of Classical Arabic (KSUCCA) is a pioneering 50 million tokens annotated corpus of Classical Arabic texts from the period of pre-Islamic era until the fourth Hijri century (equivalent to the period from the seventh until early eleventh century CE), which is the period of pure classical Arabic. The main aim of this corpus is to be used for studying the distributional lexical semantics of The Quran words. However, it can be used for other research purposes, such...
    Downloads: 4 This Week
    Last Update:
    See Project
  • 6

    RDRPOSTagger

    A Rule-based Part-of-Speech and Morphological Tagging Toolkit

    RDRPOSTagger is a robust, easy-to-use and language-independent rule-based toolkit for Part-of-Speech (POS) and morphological tagging. RDRPOSTagger obtains fast performance in both learning and tagging process. RDRPOSTagger also achieves a very competitive accuracy in comparison to the state-of-the-art results. RDRPOSTagger now supports pre-trained POS and morphological tagging models for Bulgarian, Czech, Dutch, English, French, German, Hindi, Italian, Portuguese, Spanish, Swedish, Thai and Vietnamese. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 7

    Mobile Tracking Processor

    An easy way to remove noise from mobile telemetry data

    This program takes a .txt input file containing the raw output from a Lotek SRX800 receiver and removes false detections based on a user-provided .csv of tagging data.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 8
    Phrasal

    Phrasal

    Statistical phrase-based machine translation system

    ...Our work ranges from basic research in computational linguistics to key applications in human language technology, and covers areas such as sentence understanding, automatic question answering, machine translation, syntactic parsing and tagging, sentiment analysis.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 9

    VnDP

    A Vietnamese dependency parsing toolkit

    VnDP is a Vietnamese dependency parsing toolkit which integrates a pre-trained parsing model and a pre-trained POS tagging model. The parsing model was trained on our VnDT Vietnamese dependency Treebank which was automatically converted from the Vietnamese constituent Treebank. See more details in VnDP's website at http://vndp.sourceforge.net/
    Downloads: 0 This Week
    Last Update:
    See Project
  • $300 in Free Credit Towards Top Cloud Services Icon
    $300 in Free Credit Towards Top Cloud Services

    Build VMs, containers, AI, databases, storage—all in one place.

    Start your project in minutes. After credits run out, 20+ products include free monthly usage. Only pay when you're ready to scale.
    Get Started
  • 10
    CoreTML framework
    CoreTML framework is an open-source template-based configuration system allowing the developer to create parametrized templates by inserting special content to any text files. Its main purpose is to serve as a toolkit for semiconductor IP core creation (based on VHDL/Verilog).
    Downloads: 0 This Week
    Last Update:
    See Project
  • 11
    ICE Nigeria

    ICE Nigeria

    Nigerian component of the International Corpus of English

    ...This is the first release of the complete corpus. The corpus can be downloaded in several parts. The written part can be downloaded as text files, xml files and xml files with parts of speech tagging, both with or without the raw files. For the spoken part the eaf files (ELAN files in xml format) together with the text files can be downloaded separately from the sound files. In addition, we provide the corpus manual as well as metadata (speaker age, gender, ethnic group and profession) and XML specifications.
    Downloads: 12 This Week
    Last Update:
    See Project
  • 12
    iTag

    iTag

    Tag images using various categories and modifiers

    ...iTag allows Users to define up to 9 different categories and name them accordingly. In addition, 4 modifiers are available to further increase the options during a tagging session. Users are able to load a series of Images into a session and add tags on objects on these images within previously defined categories and modifiers. Upon ending the session, result files are produced including (if provided by the EXIF data) the gps information for each Picture, the number of objects in each category and a detailed result file that describes each individual object. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 13

    Drug Extraction

    Drug name extraction

    ...accuracy: 95.25%; precision: 85.70%; recall: 76.20%; FB1: 80.67 Using GATE Corpus Benchmark: Strict: P: 0.65 R: 0.73 F1: 0.69 Lenient: P: 0.74 R: 0.84 F1: 0.78 The details of how to reproduce evaluation, see README. To use standalone version for tagging download DrugExtractionStandalone.tar.gz from Files.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 14

    Aelius Brazilian Portuguese POS-Tagger

    Python, NLTK-based package for shallow parsing of Brazilian Portuguese

    Aelius is an ongoing open source project aiming at developing a suite of Python, NLTK-based modules and interfaces to external freely available tools for shallow parsing of Brazilian Portuguese. It also includes language resources such as language models, sample texts, and gold standards. Presently, Aelius already offers facilities for POS-tagging and chunking corpora and outputting annotations in different formats, such as in XML in the TEI P5 encoding scheme.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 15
    SCAN
    SCAN (Smart Content Aggregation and Navigation) is a universal semantic content aggregator. It combines search, text analysis, tagging and metadata functions to provide new user experience of desktop navigation and document management.
    Downloads: 6 This Week
    Last Update:
    See Project
  • 16

    Bermuda Text-to-Speech

    This project includes basic NLP and DSP techniques for Text-to-Speech

    See TTS demo at: http://rslp.racai.ro/index.php?page=tts This is an entirely written in JAVA project which includes a set of tools and methods designed to enable Multilingual Text-to-Speech (TTS) synthesis. We currently support English and Romanian but we will soon train more models and make them available for download. If you want to read more about our other NLP and TTS tools check out http://nlptools.racai.ro.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 17
    Cotovía

    Cotovía

    Text-to-Speech System for Galician and Spanish

    Cotovía is a unit-selection text-to-speech system for Galician and Spanish. Cotovía is distributed under the GPL3.0+ license, but each of the avaliable speaker voices has its own license. The speakers available at sourceforge are free for commercial and non-commercial uses. Another speaker, free for non-commercial uses, is avaliable through external links (see the Blog section). Cotovia has been developed by the University de Vigo and the center 'Ramón Piñeiro' for Research in Humanities,...
    Downloads: 1 This Week
    Last Update:
    See Project
  • 18
    ...Plinko should identify and parse the data completely without the sending system caring what it sent. The latest version supports named fields in the STL files for tagging data parsed in the Prefix Tree and anonymous functions for parsing dynamic message payloads. We now output JSON objects and I'm working on HBase integration. By outputting to JSON it also leaves open the possibility for on the fly in memory correlation between events. Read the included README before starting, it has a quick start guide and info on the constructors.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 19

    Obeliks

    Obeliks: Oblikoslovni označevalnik za slovenski jezik

    ...Izvorna koda je na GitHub-u (glej Wiki). // The aim of the Obeliks project is to develop the most accurate statistical tagger for the Slovene language. Morphosyntactic tagging is the process of categorizing a word in a text into a particular part of speech category and describing it with various morphological features related to that category. The source code is on GitHub (see Wiki).
    Downloads: 0 This Week
    Last Update:
    See Project
  • 20
    Basic Note Tagger
    Standalone application that uses the clipboard to insert a tag in any document and/or in a log file. Tags contain context information (e.g., date, author) + text note, picture or webcam shot. Tags can contain variables generated by user and/or external programs.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 21

    Mansour

    Arabic text analyzer

    Mansour is a simple application for analyzing digital text written in Arabic.منصور هو تطبيق مكتبي بسيط لتحليل النصوص الرقمية المكتوبة باللغة العربية.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 22
    HanNanum - Korean POS Tagger
    HanNanum is a Korean Morphological Analyzer and POS Tagger. A plug-in component-based architecture is adapted to the new Java version for flexible use. You can find the work flow for morphological analysis, POS tagging, noun extraction, etc. Contact: kschoi@kaist.ac.kr hjjeong@world.kaist.ac.kr
    Downloads: 0 This Week
    Last Update:
    See Project
  • 23
    The Rudify tools are a collection of tools for ontology tagging.
    Downloads: 2 This Week
    Last Update:
    See Project
  • 24
    TagSEA is a framework for tagging locations of interest within Eclipse. Visit our project page for more information - including downloads and documentation: tagsea.sourceforge.net
    Downloads: 0 This Week
    Last Update:
    See Project
  • 25
    Maximum entropy is a powerful method for constructing statistical models of classification tasks, such as part of speech tagging in Natural Language Processing. Several example applications using maxent can be found in the OpenNLP Tools Library.
    Downloads: 2 This Week
    Last Update:
    See Project
  • Previous
  • You're on page 1
  • 2
  • Next
MongoDB Logo MongoDB