Showing 25 open source projects for "dictionary"

View related business solutions
  • Gemini 3 and 200+ AI Models on One Platform Icon
    Gemini 3 and 200+ AI Models on One Platform

    Access Google's best plus Claude, Llama, and Gemma. Fine-tune and deploy from one console.

    Build generative AI apps with Vertex AI. Switch between models without switching platforms.
    Start Free
  • MongoDB Atlas runs apps anywhere Icon
    MongoDB Atlas runs apps anywhere

    Deploy in 115+ regions with the modern database for every enterprise.

    MongoDB Atlas gives you the freedom to build and run modern applications anywhere—across AWS, Azure, and Google Cloud. With global availability in over 115 regions, Atlas lets you deploy close to your users, meet compliance needs, and scale with confidence across any geography.
    Start Free
  • 1
    OpenPromptStudio

    OpenPromptStudio

    Visual editor for AI prompts with translation, categories, and tools

    ...OpenPromptStudio also provides translation capabilities that can convert Chinese prompts into English and display Chinese translations for English prompts, which is especially useful for tools that require English inputs. A built-in prompt dictionary helps users quickly access commonly used prompt fragments and reuse them in different projects. Users can optionally manage and maintain this dictionary using a connected workspace database, enabling more flexible prompt organization.
    Downloads: 2 This Week
    Last Update:
    See Project
  • 2
    LLPlayer

    LLPlayer

    The media player for language learning, with dual subtitles

    LLPlayer is an open-source media player designed specifically for language learning through video content. Unlike traditional media players, the application focuses on advanced subtitle-related features that help learners understand and interact with foreign language media more effectively. The player supports dual subtitles so users can simultaneously view text in both the original language and their native language while watching videos. It can also automatically generate subtitles in real...
    Downloads: 56 This Week
    Last Update:
    See Project
  • 3
    gse

    gse

    Go efficient multilingual NLP and text segmentation

    ...Gse is implements jieba by golang, and try add NLP support and more feature. Support common, search engine, full mode, precise mode and HMM mode multiple word segmentation modes. Support user and embed dictionary, Part-of-speech/POS tagging, analyze segment info, stop and trim words. Support multilingual: English, Chinese, Japanese and others. Support Traditional Chinese. Support HMM cut text use Viterbi algorithm. Support NLP by TensorFlow (in work). Named Entity Recognition (in work). Supports with elastic search and bleve. run JSON RPC service.
    Downloads: 5 This Week
    Last Update:
    See Project
  • 4
    Style-Bert-VITS2

    Style-Bert-VITS2

    Style-Bert-VITS2: Bert-VITS2 with more controllable voice styles

    Style-Bert-VITS2 is a text-to-speech system based on Bert-VITS2 that focuses on highly controllable voice styles and emotional expression. It takes the original Bert-VITS2 v2.1 and its Japanese-Extra variant and extends them so you can control emotion and speaking style with fine-grained intensity, not just choose a generic tone. The project targets both power users and beginners: Windows users without Git or Python can install and run it using bundled .bat scripts, while advanced users can...
    Downloads: 11 This Week
    Last Update:
    See Project
  • AI-powered service management for IT and enterprise teams Icon
    AI-powered service management for IT and enterprise teams

    Enterprise-grade ITSM, for every business

    Give your IT, operations, and business teams the ability to deliver exceptional services—without the complexity. Maximize operational efficiency with refreshingly simple, AI-powered Freshservice.
    Try it Free
  • 5
    ClearML

    ClearML

    Streamline your ML workflow

    ClearML is an open source platform that automates and simplifies developing and managing machine learning solutions for thousands of data science teams all over the world. It is designed as an end-to-end MLOps suite allowing you to focus on developing your ML code & automation, while ClearML ensures your work is reproducible and scalable. The ClearML Python Package for integrating ClearML into your existing scripts by adding just two lines of code, and optionally extending your experiments...
    Downloads: 2 This Week
    Last Update:
    See Project
  • 6
    MoCo (Momentum Contrast)

    MoCo (Momentum Contrast)

    Self-supervised visual learning using momentum contrast in PyTorch

    ...It introduces Momentum Contrast (MoCo), a scalable approach to self-supervised learning that enables visual representation learning without labeled data. The core idea of MoCo is to maintain a dynamic dictionary with a momentum-updated encoder, allowing efficient contrastive learning across large batches. The repository includes implementations for both MoCo v1 and MoCo v2, the latter improving training stability and performance through architectural and augmentation enhancements. Training is optimized for distributed multi-GPU environments, using DistributedDataParallel for speed and simplicity.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 7

    uweb browser: unlimited power

    minimal suckless android web browser with unlimited power

    ... - Customizable: user-defined menus, (new) buttons and gestures for user agents, bookmarklets, url services, shell commands, internal functionality links and text processing etc. - Convenient: book/dictionary/txt/command line/app can be search engine. - Tiny: less than 200k - Fast: run fast, even with thousands of user provided css/scripts - Efficient: less touches, one click to reach any number of search engines without repeated input; automate online services. - URL bar command line support ("!" and .js files as commands). - user-defined site-specific JS/CSS/HTML/preprocessing...
    Downloads: 7 This Week
    Last Update:
    See Project
  • 8
    find-similar

    find-similar

    User-friendly library to find similar objects

    The mission of the FindSimilar project is to provide a powerful and versatile open source library that empowers developers to efficiently find similar objects and perform comparisons across a variety of data types. Whether dealing with texts, images, audio, or more, our project aims to simplify the process of identifying similarities and enhancing decision-making. https://github.com/findsimilar/find-similar - GitHub repo http://demo.findsimilar.org/ - Demo project and...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 9
    Synonyms

    Synonyms

    Chinese synonyms, chat robot, intelligent question and answer toolkit

    Chinese Synonyms for natural language processing and understanding. Better Chinese synonyms, chatbot, intelligent question and answer toolkit. synonymsCan be used for many tasks in natural language understanding, text alignment, recommendation algorithms, similarity calculation, semantic shifting, keyword extraction, concept extraction, automatic summarization, search engines, etc. Print synonyms in a friendly way for easy debugging. "Synonyms Cilin" was compiled by Mei Jiaju and others in...
    Downloads: 0 This Week
    Last Update:
    See Project
  • Forever Free Full-Stack Observability | Grafana Cloud Icon
    Forever Free Full-Stack Observability | Grafana Cloud

    Our generous forever free tier includes the full platform, including the AI Assistant, for 3 users with 10k metrics, 50GB logs, and 50GB traces.

    Built on open standards like Prometheus and OpenTelemetry, Grafana Cloud includes Kubernetes Monitoring, Application Observability, Incident Response, plus the AI-powered Grafana Assistant. Get started with our generous free tier today.
    Create free account
  • 10
    NLP-progress

    NLP-progress

    Repository to track the progress in Natural Language Processing (NLP)

    Repository to track the progress in Natural Language Processing (NLP), including the datasets and the current state-of-the-art for the most common NLP tasks. This document aims to track the progress in Natural Language Processing (NLP) and give an overview of the state-of-the-art (SOTA) across the most common NLP tasks and their corresponding datasets. It aims to cover both traditional and core NLP tasks such as dependency parsing and part-of-speech tagging as well as more recent ones such...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 11
    MUSE

    MUSE

    A library for Multilingual Unsupervised or Supervised word Embeddings

    ...By mapping languages into a common vector space, MUSE makes it straightforward to build cross-lingual applications where resources are scarce for some languages. The training and evaluation pipeline is lightweight and fast, so experimenting with different languages or initialization strategies is easy. Beyond dictionary induction, the learned embeddings are often used as building blocks for downstream tasks like classification, retrieval, or machine translation.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 12
    kcws

    kcws

    Deep Learning Chinese Word Segment

    ...Pay attention to the public account of waiting for words and reply to kcws to get the corpus download address. Extract the corpus to a directory. Change to the code directory.After installing tensorflow, switch to the kcws code directory. Currently, the custom dictionary is supported in the decoding stage. Please refer to kcws/cc/test_seg.cc for specific usage. The dictionary is in text format.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 13
    auto_ml

    auto_ml

    Automated machine learning for analytics & production

    ...Here's an example that includes serializing and loading the trained model, then getting predictions on single dictionaries, roughly the process you'd likely follow to deploy the trained model. Before you go any further, try running the code. Load up some data (either a DataFrame, or a list of dictionaries, where each dictionary is a row of data). Make a column_descriptions dictionary that tells us which attribute name in each row represents the value we’re trying to predict. Pass all that into auto_ml, and see what happens! You can pass in your own function to perform feature engineering on the data. This will be called as the first step in the pipeline that auto_ml builds out. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 14

    Devanagari OCR

    Devanagari Optical Character Recognition, Annotation tool

    The project has source code and data related to the following tools: 1. Optical Character Recognition. Recognize machine printed Devanagari with or without a dictionary. 2. Document Image Analysis. Automatic page segmentation of document images in multiple Indian languages. Identifies pictures, lines, and words in a document scanned at 300 dpi. 3. Multi-lingual annotation. An interface that has transilteration and a soft-keyboard using which multiple languages can be input. The UI also enables users to view the word and character level ground truth of images. ...
    Downloads: 6 This Week
    Last Update:
    See Project
  • 15
    Ansj Chinese word segmentation

    Ansj Chinese word segmentation

    Ansj word segmentation

    The real java implementation of ict. The word segmentation effect is faster than the open source version of ict. Chinese word segmentation, name recognition, part-of-speech tagging, user-defined dictionary. This is a java implementation of Chinese word segmentation based on n-Gram+CRF+HMM. The word segmentation speed reaches about 2 million words per second (tested under mac air), and the accuracy rate can reach more than 96%. At present, it has realized the functions of Chinese word segmentation, Chinese name recognition, user-defined dictionary, keyword extraction, automatic summarization, and keyword tagging. ...
    Downloads: 4 This Week
    Last Update:
    See Project
  • 16

    Drug Extraction

    Drug name extraction

    Drug name recognition and normalisation/grounding to DrugBank ids and standard names. Package provides 2 taggers: 1. DrugTagger - CRF-based with DrugBank presence feature (see feature set for details). 2. DrugnameGazetteer - gazetteer/dictionary-based. Dictionary created from DrugBank.ca database. Both taggers include grounding/normalisation to DrugBank ids and standard names. Feature set: Word, Word-1, Word+1, Word-1_Word, Word_Word+1, DrugBankPresence, POS DrugBankPresence feature indicates the presence of the drug name in the DrugBank. Using CONLL-Evaluation: processed 32065 tokens with 3656 phrases; found: 3251 phrases; correct: 2786. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 17
    VADER

    VADER

    Lexicon and rule-based sentiment analysis tool

    VADER (Valence Aware Dictionary and sEntiment Reasoner) is a lexicon and rule-based sentiment analysis tool designed for analyzing the sentiment of text, particularly in social media and short text formats. It is optimized for quick and accurate analysis of positive, negative, and neutral sentiments.
    Downloads: 6 This Week
    Last Update:
    See Project
  • 18

    Quadriga

    Quadruple Network Management System

    Quadriga is a web-application that acts as a clearing-house for text annotations -- in the form of contextualized triples, or “quadruples,” that form complex graphs -- generated with the Vogon desktop application (https://sourceforge.net/projects/gobtan/), and as an environment for managing text-annotation projects. It relies on a central authentication system for user authentication, a dictionary service (Wordpower) and an authority file service (Conceptpower). Quadriga can connect to a DSpace repository, allowing users to select items stored in the repository for annotation. Quadriga can use standard graphs to map quadruples onto conventional semantic graphs that can be submitted to a triple store, and used for interactive websites and visualizations.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 19
    This is a database of the Arabic roots and their derivatives in voweled and unvoweled forms along with stems. The database is extracted from the well known Arabic legacy dictionary "تاج العروس من جواهر القاموس".
    Downloads: 0 This Week
    Last Update:
    See Project
  • 20

    DictionaryReader

    Research OCR Project

    DictionaryReader is a research project; the aim of which is to create software that can automatically import data from mixed-script dictionaries. (For example, a Tibetan-English dictionary.)
    Downloads: 0 This Week
    Last Update:
    See Project
  • 21
    The Neurpheus Morphological Analyser performs morphological analysis, stemming or word form generation tasks using sophisticated classification methods for an analysis of words unseen in a training dictionary.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 22
    Arabic WordNet: A multi-lingual concept dictionary with mappings between word senses in Arabic and those in the Princeton WordNet (English) v2.0. Supports searching and browsing of Arabic and English terms.
    Downloads: 2 This Week
    Last Update:
    See Project
  • 23
    JWNL is a Java API for accessing the WordNet relational dictionary. WordNet is widely used for developing NLP applications, and a Java API such as JWNL will allow developers to more easily use Java for building NLP applications.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 24
    Hollaka Hollala is a chatbot engine and IRC bot. The chatbot engine uses a knowledge base organized in an XML file and WordNet for dictionary lookup. The IRC Bot has some WordNet- and Web-based functionality and is hot-extensible through Ruby programs.
    Downloads: 2 This Week
    Last Update:
    See Project
  • 25
    is a natural language understanding program written in Python. It includes an English language parser, an English dictionary tagged with parts of speech and including phrasal verbs and common first names, a spelling corrector, and an indexed file module
    Downloads: 0 This Week
    Last Update:
    See Project
  • Previous
  • You're on page 1
  • Next
MongoDB Logo MongoDB