Showing 142 open source projects for "language processing"

View related business solutions
  • Stop vibe-debugging. Icon
    Stop vibe-debugging.

    Plug Claude into your app's actual errors.

    AppSignal's MCP server hands Claude, Cursor, or Zed your real errors, traces, and the deploy that shipped them. AI writes the fix; you review the diff.
    Free 30 days.
  • Your monitoring isn't a stack. It's a pile. Fix that. Icon
    Your monitoring isn't a stack. It's a pile. Fix that.

    Errors, performance, logs, uptime. One install, one invoice, one UI.

    Replace Datadog, New Relic, and Sentry without adding three more dashboards.
    Free 30 days.
  • 1
    q pipeline manager

    q pipeline manager

    q: integrated platform for pipeline configuration and management

    The q utility is a platform for creating and managing data analysis pipelines. It expands the value of your existing job scheduler - either Grid Engine or TORQUE PBS - through numerous functions that help you organize, submit, monitor, manage and share your informatics work. Data processing pipelines require high-level organization and parallelization of work to optimize resource utilization and decrease the time to results. q (from queue) allows complex job sequences to be efficiently...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 2
    gannu

    gannu

    Java API and tools for performing NLP and other AI tasks

    Java API and tools for performing a wide range of AI tasks such as: word sense disambiguation (released), optimization (5 Evolutionary Algorithms Implemented ETA February 2014), opinion mining (ETA November 2014) and text wikification (ETA July 2014). Gannu includes some graphical interfaces for scientific purposes. When using Gannu please cite: *Jiménez, F. V., Gelbukh, A. F. & Sidorov, G. (2013). Simple Window Selection Strategies for the Simplified Lesk Algorithm for Word Sense...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 3
    Knowtator is a general-purpose text annotation tool that is integrated with the Protégé knowledge representation system. Knowtator facilitates the manual creation of training and evaluation corpora for a variety of biomedical language processing tasks.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 4

    Hermes Natural Language Processing

    A repository of software, documentation and data for NLP

    Hermes is a repository of software, documentation and data for NLP. I am currently adding corpora extracted from Wikipedia (mostrly in Romance languages).
    Downloads: 0 This Week
    Last Update:
    See Project
  • $300 Free Credits for Your Google Cloud Projects Icon
    $300 Free Credits for Your Google Cloud Projects

    Start building on Google Cloud with $300 in free credits. No commitment, no credit card required until you're ready to scale.

    Launch your next project with $300 in free Google Cloud credits—no strings attached. Test, build, and deploy without risk. Use your credits across the entire Google Cloud platform to find what works best for your needs. After your credits are used, continue with always-free tier services. Only pay when you're ready to scale. Sign up in minutes and start exploring.
    Start Free Trial
  • 5
    Task of the project is a semantic annotation of Czech texts by using NLP tools.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 6
    IDL specified API for manipulating and processing CellML 1.0 and 1.1. Includes C++ implementation. Accessible from a C++ program, or from any language for which a CORBA language mapping is available Also, a Java wrapper of the API is available.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 7
    LuaCV

    LuaCV

    OpenCV library wrapper for Lua language

    LuaCV is OpenCV library wrapper for Lua language. Main goal is to access fast image processing library to light-weight, non-type language. Thanks to CMake project is a much easier to compile it under many OS platforms which are supported by OpenCV. LuaCV is being developed at faculty of Electrical Engineering and Communication of Brno University of Technology in Czech republic.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 8

    fem2d

    2D Finite Element Method Tools

    Collection of programs developed to perform various engineering analyses on structures using the finite element technique.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 9

    miac-p

    Code for syntactic parsing and other NLP apps.

    Code for syntactic parsing and other natural language processing applications.
    Downloads: 0 This Week
    Last Update:
    See Project
  • Enterprise-grade ITSM, for every business Icon
    Enterprise-grade ITSM, for every business

    Give your IT, operations, and business teams the ability to deliver exceptional services—without the complexity.

    Freshservice is an intuitive, AI-powered platform that helps IT, operations, and business teams deliver exceptional service without the usual complexity. Automate repetitive tasks, resolve issues faster, and provide seamless support across the organization. From managing incidents and assets to driving smarter decisions, Freshservice makes it easy to stay efficient and scale with confidence.
    Try it Free
  • 10
    Transliterator between any Language files - Map Fonts, Create Encoding Scheme, Input Phonetic, Indian, Roman, Tamil, Hindi, English, French, German, Spanish or Any World Language Keyboard. Ex: [Phonetic Input]-[Any World Language Output] or ViceVersa.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 11
    Redundancy due to cut-paste operations in text creates bias in machine learning for NLP. This module takes a directory and produces a subset of the files in that directory (in a list) with an upper bound on similarity between two files.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 12
    CRFSharp

    CRFSharp

    CRFSharp is a .NET(C#) implementation of Conditional Random Field

    CRFSharp(aka CRF#) is a .NET(C#) implementation of Conditional Random Fields, an machine learning algorithm for learning from labeled sequences of examples. It is widely used in Natural Language Process (NLP) tasks, for example: word breaker, postagging, named entity recognized, query chunking and so on. CRF#'s mainly algorithm is the same as CRF++ written by Taku Kudo. It encodes model parameters by L-BFGS. Moreover, it has many significant improvement than CRF++, such as totally...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 13

    jaf_Utils

    A C++ library for Statistical Language Processing tasks.

    A C++ library for Statistical Language Processing tasks.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 14
    This is a Java-based project for complex event extraction from text and co-reference resolution. Currently the code can read BioNLP shared task format (http://2011.bionlp-st.org/) and i2b2 Natural Language Processing for Clinical Data shared task format (https://www.i2b2.org/NLP/DataSets/Main.php). Event extraction includes finding events and the parameters for an event in a text. The method is based on SVM but other ML algorithms can be adopted. The method details are explained in the following paper: Ehsan Emadzadeh, Azadeh Nikfarjam, and Graciela Gonzalez. 2011. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 15
    A toolkit with using Suffix Array indexing for empirical natural language processing. Providing functions such as searching the occurrences of n-grams in the corpus and suffix array language model which can use arbitrarily long history.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 16
    The Wikipedia Miner toolkit provides simplified access to Wikipedia. This open encyclopedia represents a vast, constantly evolving multilingual database of concepts and semantic relations; a promising resource for nlp and related research.
    Downloads: 3 This Week
    Last Update:
    See Project
  • 17
    A system to perform analysis of large documents for the purpose of cataloging similar documents. Similarity is based upon contextual analysis of these documents done by identifying common words and proper nouns.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 18
    Web application to make user-friendly requests on large XML database. Tools to XML-ize large bodies of semi-formal texts (like floras). Computer-assisted specimen identification. Uses natural language processing, 2D/3D images analysis and generation.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 19
    ELIA(Eyegaze Language Integration Analysis) supports the analysis of eye-tracking data for studies in language processing. ELIA eases early analysis of data to enable iterative development of experiments in response to spoken language.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 20
    TextMarker
    TextMarker is now developed and hosted at Apache UIMA (http://uima.apache.org/textmarker.html). TextMarker is a UIMA-based tool for information extraction and more. The full featured editor of the rule language and the build process of UIMA descriptors are complemented with components for visualization, explanation, testing and rule learning.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 21
    TIDAL is a collection of data acquisition and processing tools for research in respiratory physiology. It provides a high-level language for specifying experimental protocols and required computations.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 22
    Sylli
    Sylli is a universal syllabifier. Developed for Italian, it can easily be adapted to any language that is claimed to respect the SSP. Sylli divides timit, strings, files and directories into syllables.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 23
    Maximum entropy is a powerful method for constructing statistical models of classification tasks, such as part of speech tagging in Natural Language Processing. Several example applications using maxent can be found in the OpenNLP Tools Library.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 24
    A language modeling tool kit written in Java for natural language processing applications. It can handle character-by-character modeling of unknown words, language model combination, comparison, and evaluation, as well as a number of smoothing techniques
    Downloads: 0 This Week
    Last Update:
    See Project
  • 25
    The Scheme Natural Language Toolkit (S-NLTK) is a Scheme R6RS library for language and text processing, and various tasks related to symbolic and statistical analysis of language data.
    Downloads: 0 This Week
    Last Update:
    See Project