Search Results for "natural language processing" - Page 42

Showing 1440 open source projects for "natural language processing"

View related business solutions
  • Fully Managed MySQL, PostgreSQL, and SQL Server Icon
    Fully Managed MySQL, PostgreSQL, and SQL Server

    Automatic backups, patching, replication, and failover. Focus on your app, not your database.

    Cloud SQL handles your database ops end to end, so you can focus on your app.
    Try Free
  • Build Agents and Models on One Platform Icon
    Build Agents and Models on One Platform

    Everything you need to build production-ready agents and models. Access 200+ Google and third-party AI models and tools.

    Gemini Enterprise Agent Platform is Google Cloud's comprehensive platform for developers to build, scale, govern, and optimize agents and models. Choose from Google's most advanced models and third-party models like Anthropic's Claude Model Family.
    Try It Free
  • 1
    Sedna Native XML Database

    Sedna Native XML Database

    Sedna is a native full-featured XML database management system.

    Sedna is a free native XML database which provides a full range of core database services - persistent storage, ACID transactions, security, indices, hot backup. Flexible XML processing facilities include W3C XQuery implementation, tight integration of XQuery with full-text search facilities and a node-level update language.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 2

    OpenSchema

    Document Structuring schema for Natural Language Generation

    An Open Source Java implementation of Document Structuring schema (pl. schemata). Schemata structure (order) a set of facts (RDF in this case). They decide what to say in a Natural Language Generation (NLG) system. Introduced by McKeown (1985).
    Downloads: 0 This Week
    Last Update:
    See Project
  • 3
    This project is devoted to the development of natural language processing tools and resources for the Lingala language, which is spoken by tens of millions of people in central Africa.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 4

    PRADA

    PRADA : Pipeline for RNA-Sequencing Data Analysis

    Massively parallel sequencing of cDNA reverse transcribed from RNA (RNASeq) provides an accurate estimate of the quantity and composition of mRNAs. To characterize the transcriptome through the analysis of RNA-seq data, we developed PRADA. PRADA focuses on the processing and analysis of gene expression estimates, supervised and unsupervised gene fusion identification, and supervised intragenic deletion identification. PRADA currently supports 7 modules to process and identify...
    Downloads: 0 This Week
    Last Update:
    See Project
  • Stop Cyber Threats with VM-Series Next-Gen Firewall on Azure Icon
    Stop Cyber Threats with VM-Series Next-Gen Firewall on Azure

    Native application identity and user-based security for your Azure cloud

    Gain integrated visibility across all traffic in a single pass. Deploy Palo Alto Networks VM-Series to determine application identity and content while automating security policy updates via rich APIs.
    Get a free trial
  • 5
    OpenNLP provides the organizational structure for coordinating several different projects which approach some aspect of Natural Language Processing. OpenNLP also defines a set of Java interfaces and implements some basic infrastructure for NLP compon
    Leader badge
    Downloads: 26 This Week
    Last Update:
    See Project
  • 6
    ArabicDiacritizer

    ArabicDiacritizer

    An automatic restoration of Arabic diacritic marks

    This is a software of Arabic diacritical marks restoration. It is based mainly on deep architectures using deep neural network. The algorithm generates diacritized text with determined end case. The algorithm is described in detail in: Ilyes Rebai, and Yassine BenAyed 'Text-to-speech synthesis system with Arabic diacritic recognition system', Computer Speech & Language, 2015. We appreciate it very much if you can cite our related work. ************** Installation...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 7
    Yes, finally. The author decide to schedule some of his busy time to release the "Chinese Language Formula"- Step By Step. It will take a while, but eventually Chinese Language Processing is no longer a problem.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 8
    This package contains different tools to add NLP capabilities for Lucene 4.x (it has been tested using Lucene version from 4.6.x to 4.8.1). Although it was originally developed for German, it is, mostly, language independent. It allows the user to lemmatize words to be indexed, to weight termy ba their parts of speech (e.g. weighting nouns mor hevaily than pronouns), and to add synonyms taken from GermaNet or a list you provide to the search index and thereby increase recall of lucene.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 9
    TeleScope

    TeleScope

    XML Data Stream Broker/Replicator

    TeleScope is the efficient intensive-load XML data stream broker, replicator and simple event processing platform (SEP) written in C for the Fedora 17-18, Slackware 13-14, Red Hat Enterprise Linux 6 (RHEL-6) Linux distributions. The platform is intended to be operated upon the single number/word values and is not meant to be deployed for full-text XML stream analysis. TeleScope has internal query language with a set of standard logical operators that allows to construct relatively complex query expressions. ...
    Downloads: 1 This Week
    Last Update:
    See Project
  • Auth0 B2B Essentials: SSO, MFA, and RBAC Built In Icon
    Auth0 B2B Essentials: SSO, MFA, and RBAC Built In

    Unlimited organizations, 3 enterprise SSO connections, role-based access control, and pro MFA included. Dev and prod tenants out of the box.

    Auth0's B2B Essentials plan gives you everything you need to ship secure multi-tenant apps. Unlimited orgs, enterprise SSO, RBAC, audit log streaming, and higher auth and API limits included. Add on M2M tokens, enterprise MFA, or additional SSO connections as you scale.
    Sign Up Free
  • 10

    pyWeb Literate Programming Tool

    Literate Programming in pure Python

    pyWeb is a Literate Programming tool that will work with any markup language and any programming langauge. The idea is to allow you to create great documentation with as constraints or limitations.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 11

    Persica-A new Persian corpus for NLP

    This project presents a new corpus for NEWS text analysis in Persian

    Lack of multi-application text corpus despite of the surging text data is a serious bottleneck in the text mining and natural language processing especially in Persian language. This project presents a new corpus for NEWS articles analysis in Persian called Persica. NEWS analysis includes NEWS classification, topic discovery and classification, category classification and many more procedures. Dealing with NEWS has special requirements and first of all a valid and reliable corpus to perform the experiments on them. ...
    Downloads: 2 This Week
    Last Update:
    See Project
  • 12
    Downloads: 0 This Week
    Last Update:
    See Project
  • 13
    Caméléon, source code

    Caméléon, source code

    a graphical language for functional programming

    Warning: This version is the source code of Caméléon, new web site will be released with tutorials & documentations. Caméléon Creator is a graphical algorithm editor designed for processing research, prototyping & calibration. With Caméléon, you construct & calibrate data processes with a simple drag & drop mechanism. Caméléon is a graphical language, based on the theory. Caméléon is a try to make data processing research & prototyping simple & addictive.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 14
    Aqsis Renderer
    Aqsis is a cross-platform 3D rendering solution adhering to the RenderMan standard defined by Pixar. Features include programmable shading, true displacements, 3D depth-of-field, motion blur, NURBS, subdivision surfaces, CSG, procedural plugins and m
    Downloads: 3 This Week
    Last Update:
    See Project
  • 15
    PrestaShop Purchase Order Module

    PrestaShop Purchase Order Module

    Prestashop Purchase Order Payment module is a Prestashop module that

    Prestashop Purchase Order Payment module is a Prestashop module that allows purchase orders to be accepted directly during the checkout procedure. The customer of the store can able to complete the checkout process based on the other terms made between the store and customer through Purchase Order with its reference no in this order process. Free Module available from www.modulebazaar.com
    Downloads: 0 This Week
    Last Update:
    See Project
  • 16

    Larch: Data Analysis for X-ray Spectra

    Data Processing and Analysis for X-ray Spectroscopy and More

    Larch is a scientific data processing language that is designed to be easy to use for novices and complete enough for advanced data processing and analysis. Larch provides a wide range of functionality for dealing with arrays of scientific data, and basic tools to make it easy to use and organize complex data. Larch has been primarily developed for dealing with x-ray spectroscopic and scattering data, especially the kind of data collected at modern synchrotrons and x-ray sources. ...
    Downloads: 1 This Week
    Last Update:
    See Project
  • 17
    DeSR is a multilingual statistical dependency parser. It produces dependency parse trees for natural language sentences using a parsing model learned from annotated corpora.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 18

    cosd_java

    A Tool for Searching and Demonstrating Code Structure

    CoSD is a tool for searching and demonstrating code structure based on graph database. Since some commercial code is involved in the industrial version of CoSD , we just give out a baseline version of it. We also provide a quick demo for this baseline version. To use CoSD, please read the file readme.txt at first.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 19
    Cascalog

    Cascalog

    Data processing on Hadoop without the hassle

    Cascalog is a powerful Clojure (and Java) data processing and querying library built atop Hadoop (via Cascading), providing a high-level, Datalog-inspired abstraction for both big data processing and local computation. Cascalog is hosted at Clojars, and some of its dependencies are hosted at Conjars. Both Clo/Con-jars are maven repos that's easy to use with maven or leiningen. The Cascalog website contains more information and links to Various articles and tutorials. The best way to get...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 20
    A java-based parser for parsing/grabbing web sites and other text or XML documents, based on a nondeterministic parser language, creating XML output. Also contains a few utility classes for HTML, CSV and text parsing, and additional character sets.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 21

    AILOKS

    A polylingual dictionary/ontology system

    AILOKS is an acronym for Artificial Intelligence Linguistic Object Knowledge System. This dictionary is aimed to facilitate advanced NLP, translation engines and provide an effective dictionary interface to humans. This dictionary is Polylingual and should prove useful to language students, linguists and translators alike. It is part of a series of interrelated projects.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 22

    EBMSummariserCorpus

    A corpus for evidence-based medicine query-based summarisation

    This corpus has been sourced from the Clinical Inquiries section of the Journal of Family Practice (http://www.jfponline.com/) with kind permission of the publishers to distribute to the research community. For further details and to cite this work, see: D. Mollá, M.E. Santiago-Martinez A. Sarker, C. Paris. A Corpus for Research in Text Processing for Evidence Based Medicine (2016). Language Resources and Evaluation, 50(4):705-727. DOI 10.1007/s10579-015-9327-2 http://link.springer.com/article/10.1007/s10579-015-9327-2 D. Mollá, M.E. Santiago-Martinez (2011). Development of a Corpus for Evidence Based Medicine Summarisation. Proceedings of Australasian Language Technology Workshop, 86-94. http://alta.asn.au/events/alta2011/proceedings/index.html The second paper is available with the corpus.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 23

    scdict

    A search engine for non-OCRed scanned dictionaries.

    The utility is a search engine for non-OCRed scanned dictionaries in PDF and DjVu formats. Platform: *nix. This in an alpha; expect evolution and broken backward compatibility. Managing a 1000 page scanned dictionary can be quite handy if you can just type in a word and the computer opens the volume at the right page. SCDICT wants to provide exactly this interface. How should the computer know the right page? We assume that there is no text layer in the dictionary (they are often...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 24
    The BioNLP UIMA Component Repository provides UIMA wrappers for novel and well-known 3rd-party NLP tools used in biomedical text prosessing, such as tokenizers, parsers, named entity taggers, and tools for evaluation.
    Downloads: 4 This Week
    Last Update:
    See Project
  • 25
    Board Game Language
    Board Game Language (BGL, pronounced "bagel") is a natural language syntax programming language for first-time programmers. It uses board games as a metaphor for programming concepts, with the goal of teaching users the foundations of programming.
    Downloads: 0 This Week
    Last Update:
    See Project
Auth0 Logo