Showing 105 open source projects for "python text parser"

View related business solutions
  • Our Free Plans just got better! | Auth0 Icon
    Our Free Plans just got better! | Auth0

    With up to 25k MAUs and unlimited Okta connections, our Free Plan lets you focus on what you do best—building great apps.

    You asked, we delivered! Auth0 is excited to expand our Free and Paid plans to include more options so you can focus on building, deploying, and scaling applications without having to worry about your security. Auth0 now, thank yourself later.
    Try free now
  • Level Up Your Cyber Defense with External Threat Management Icon
    Level Up Your Cyber Defense with External Threat Management

    See every risk before it hits. From exposed data to dark web chatter. All in one unified view.

    Move beyond alerts. Gain full visibility, context, and control over your external attack surface to stay ahead of every threat.
    Try for Free
  • 1

    Safe Harbor Deidentification

    Safe Harbor Deidentification for medical documents

    Phalanx - Deidentify Safe Harbor Deidentification Mode of Phalanx is an abridged pipeline of NLP annotators culminating in NER annotators which write output of text offsets. It uses the Safe Harbor deidentification method.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 2
    CMD Plot Tool

    CMD Plot Tool

    Calculates and plots Colour Magnitude Diagrams from Astronomical data

    CMD Plot Tool calculates and plots Colour Magnitude Diagrams (CMDs) from astronomical photometric data, e.g. of a star cluster observed in two filter bandpasses. It can handle multiple file formats (plain text, DAOPHOT .mag files, ACS Survey of Galactic Globular Clusters .zpt files) to generate professional and customised plots, without the usual steep learning curve. Development using Python, OOP and a formal software development lifecycle model (Agile) allowed for the creation of an application that can be deployed on multiple systems. ...
    Downloads: 3 This Week
    Last Update:
    See Project
  • 3
    Xplico

    Xplico

    Xplico is a Network Forensic Analysis Tool (NFAT)

    Xplico is a Network Forensic Analysis Tool (NFAT). The goal of Xplico is extract from an internet traffic capture the applications data contained. For example, from a pcap file Xplico extracts each email (POP, IMAP, and SMTP protocols), all HTTP contents, each VoIP call (SIP, MGCP, MEGACO, RTP), IRC, WhatsApp... Xplico is able to classify more than 140 (application) protocols. Xplico cam be used as sniffer-decoder if used in "live mode" or in conjunction with netsniff-ng. Xplico is used...
    Downloads: 51 This Week
    Last Update:
    See Project
  • 4

    Arabic Corpus

    Text categorization, arabic language processing, language modeling

    The Arabic Corpus {compiled by Dr. Mourad Abbas ( http://sites.google.com/site/mouradabbas9/corpora ) The corpus Khaleej-2004 contains 5690 documents. It is divided to 4 topics (categories). The corpus Watan-2004 contains 20291 documents organized in 6 topics (categories). Researchers who use these two corpora would mention the two main references: (1) For Watan-2004 corpus ---------------------- M. Abbas, K. Smaili, D. Berkani, (2011) Evaluation of Topic Identification Methods on...
    Downloads: 10 This Week
    Last Update:
    See Project
  • Gen AI apps are built with MongoDB Atlas Icon
    Gen AI apps are built with MongoDB Atlas

    Build gen AI apps with an all-in-one modern database: MongoDB Atlas

    MongoDB Atlas provides built-in vector search and a flexible document model so developers can build, scale, and run gen AI apps without stitching together multiple databases. From LLM integration to semantic search, Atlas simplifies your AI architecture—and it’s free to get started.
    Start Free
  • 5

    TimingDrawer

    Text based timing diagram generator

    This tool generates timing diagrams for documenting hardware design. It reads the description from a text file with a simple syntax. It generates vector graphic (EPS, SVG or EMF format). It can be used in command line mode or with a GUI. It is written in Python and works on any platform.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 6

    Presage

    the intelligent predictive text entry platform

    Presage (formerly Soothsayer) is an intelligent predictive text entry system. Presage generates predictions by modelling natural language as a combination of redundant information sources. Presage computes probabilities for words which are most likely to be entered next by merging predictions generated by the different predictive algorithms. Presage's modular and extensible architecture allows its language model to be extended and customized to utilize statistical, syntactic, and semantic...
    Leader badge
    Downloads: 323 This Week
    Last Update:
    See Project
  • 7

    dadosSemiotica

    Collecter and manager of semiotica annalisis data

    This program is a web application to collect and organize data of text analysis. It works with sets of texts and the analysis are done on portions of the length of a sentence. One of the preprocessing modules is based on CoGroo (A LibreOffice & OpenOffice.org Portuguese Grammar Checker).
    Downloads: 0 This Week
    Last Update:
    See Project
  • 8
    aioulinux

    aioulinux

    Linux for Arduino and Makers developers

    Hello, I'm the Aioulinux founder, eager to professionally revive the project. Since 2018, the demand for an IoT and Arduino-tailored environment has been evident. Seeking partners for a 2024 version targeting schools and IoT companies, aiming for a secure and comprehensive platform. If you share this vision and wish to collaborate, reach out. Let's revive Aioulinux stronger than ever! Now seeking partners: Live Distro Specialist: Expert in live distributions to ensure...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 9
    TEACUP

    TEACUP

    TCP Experiment Automation Controlled Using Python

    TEACUP automates many aspects of running TCP performance experiments in a specially-constructed physical testbed. TEACUP enables repeatable testing of different TCP algorithms over a range of emulated network path conditions, bottleneck rate limits and bottleneck queuing disciplines. TEACUP utilises a text-based configuration file to define experiments as combinations of parameters specifying desired network path and end host conditions. When multiple values are provided (e.g. for TCP...
    Downloads: 0 This Week
    Last Update:
    See Project
  • Cloud-based help desk software with ServoDesk Icon
    Cloud-based help desk software with ServoDesk

    Full access to Enterprise features. No credit card required.

    What if You Could Automate 90% of Your Repetitive Tasks in Under 30 Days? At ServoDesk, we help businesses like yours automate operations with AI, allowing you to cut service times in half and increase productivity by 25% - without hiring more staff.
    Try ServoDesk for free
  • 10

    TEES

    Turku Event Extraction System

    Turku Event Extraction System (TEES) is a free and open source natural language processing system developed for the extraction of events and relations from biomedical text. It is written mostly in Python, and should work in generic Unix/Linux environments. Currently, the TEES source code repository still remains on GitHub at http://jbjorne.github.com/TEES/ where there is also a wiki with more information.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 11

    CorNetMap

    A tool for Gene Expression Correlation Network

    Capabilities of CorNetMap: 1. Read data as tab-delimited text file. Can be used for analysis of any data set beyond gene expression. 2. Capable of both two-dimensional and multidimensional data analysis. 3. Calculate Pearson correlation and cross-correlation for analysis data with phase difference. 4. Generate correlation Heat-map and draws network map. 5. Save correlation data as text file. How to use and doccumentation:...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 12

    BioC

    We describe a simple XML format to share text documents and annotation

    A minimalist approach to share text documents and data annotations. Allows a large number of different annotations to be represented. Project files contain: - simple code to hold/read/write data and perform sample processing. - BioC-formatted corpora - BioC tools that work with BioC corpora BioC goals - simplicity - interoperability - broad use - reuse There should be little investment required to learn to use a format or a software module to process that format. We are...
    Leader badge
    Downloads: 20 This Week
    Last Update:
    See Project
  • 13
    Buildes

    Buildes

    A designer’s program for describing parts of the building

    Buildes is an integrated development environment that assists the user in creating building information. It reads a text (session) file from which it compiles the information. It then allows the user to browse, analyze and export the resulting building knowledge. The compilation system is written in pure Python. The building components created are rendered in PythonOCC. The GUI is written using PyQt.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 14
    orbkit (Moved to Github)

    orbkit (Moved to Github)

    A Modular Python Toolbox for Cross-Platform Post-Processing of Quantum

    PLEASE NOTE ORBKIT HAS BEEN MOVED TO https://github.com/orbkit/orbkit orbkit is a parallel Python program package for post-processing wave function data extracted from output files of MOLPRO (Molden File Format), TURBOMOLE (AOMix file format), GAMESS-US, PROAIMS/AIMPAC (wfn/wfx file format), and Gaussian (Output File and Formatted Checkpoint File) output files. Futhermore, an interface to cclib, a parser for quantum chemical logfiles, is provided.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 15
    Part-of-speech tagging is the task of assigning symbols from a particular set to words in a natural language text. ACOPOST implements and extends well-known machine learning techniques and provides a uniform environment for testing.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 16
    SciEnPlot

    SciEnPlot

    Data Plotting and Analysis for Science and Engineering

    ... - Implemented a 3d surface plot (GLSurface) based on OpenGL (ScienPlot v1.3.2 and above) - ColorMap surface, trisurface, Pie, Polar plots, and 3D height field, 3dBar, scatter plots (under developing), and more - Column by column plotting/calculation - LaTex commands enclosed by $ symbols can be used for the labels in Graph - Accept txt(Text) and csv(Comma separated values) formatted data - Save, copy, print Graph - Use spread sheets to display data - Textboard to organize the results - Graphs in a publishable quality - Source codes based on: Python Numpy Scipy Matplotlib WxPython Visvis etc. - Special functions - Drag and drop data files - Python console is back (since v1.3.3), capable of reusing column data - Debye and Guinier models for SANS / SAX data - More apps in our Web below
    Leader badge
    Downloads: 3 This Week
    Last Update:
    See Project
  • 17

    mwetoolkit

    THIS PROJECT MIGRATED TO https://gitlab.com/mwetoolkit/mwetoolkit3/

    THIS PROJECT MIGRATED TO https://gitlab.com/mwetoolkit/mwetoolkit3/ The Multiword Expressions toolkit aids in the automatic identification and extraction of multiword units in running text. These include idioms (kick the bucket), noun compounds (cable car), phrasal verbs (take off, give up), etc. Even though it focuses on multiword expresisons, the framework is quite complete and can also be useful in any corpus-based study in computational linguistics. The mwetoolkit can be applied to virtually any text collection, language, and MWE type. ...
    Downloads: 1 This Week
    Last Update:
    See Project
  • 18
    AsiEs stands for Asistente de Escritura (writing assistant). It provides word prediction and autocomplete for fast writing. Thought for people with difficulties writing on keyboard, improves the writing speed preventing the user from pressing at most 50% of keys to write and avoids ortographic errors. Made by Fundación Teletón Uruguay (http://www.teleton.org.uy/home/)
    Downloads: 0 This Week
    Last Update:
    See Project
  • 19

    DEAPathways

    Differential Expression Analysis for Pathways

    This project contains the source code associated with the PLoS Computational Biology publication: "Differential Expression Analysis for Pathways". The paper text can be found here: http://www.ploscompbiol.org/article/info%3Adoi%2F10.1371%2Fjournal.pcbi.1002967
    Downloads: 0 This Week
    Last Update:
    See Project
  • 20

    iMir

    Integrated pipeline for HT miRNA-Seq data analysis

    Processing of smallRNA-Seq data to gather biologically relevant information requires application of multiple statistical and bioinformatics tools from different sources, each focusing on a specific step of the analysis pipeline. The analytical workflow can be challenging for the continuous interventions by the operator, a critical factor when large numbers of datasets need to be analyzed at once. To allow a flexible and comprehensive analysis of smallRNA-Seq data we designed a novel modular...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 21
    Mishkal: Arabic Text Vocalization

    Mishkal: Arabic Text Vocalization

    Arabic Text Vocalization system

    Automatic system of vocalization of arabic text.
    Downloads: 182 This Week
    Last Update:
    See Project
  • 22
    SAMoCAD

    SAMoCAD

    Simple CAD program

    SAMoCAD - its program to create simple drawings, partially supports DXF. Main features: - create simple objects (line, arc, circle) - create complex objects (text, dimensions) - edit drawn objects - save drawing to SVG format - output drawing in PostScript format - export/import content DXF files (primitives LINE, CIRCLE, ARC, TEXT, DIMENSION) Program written in Python 2 and use the library Tkinter. Program requires no installation, but you need install Python 2.7 or later on your PC. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 23

    quichem

    Utilities for fast input of chemical formulae and equations.

    *** New releases of quichem will only be available on GitHub. *** quichem is a utility written in pure Python allowing for fast input and parsing of chemical formulae and equations. Entering data does not require any modifier keys or keystrokes to be pressed. This SourceForge page provided pre-built Windows binaries of quichem-pyside. **To run quichem on Mac OS X or Linux**, or with the wxPython front-end, please download the source from quichem's GitHub page at: github.com/spamalot/quichem.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 24
    Russian morphology tagger. Parses text(s) and output xml representation of text(s) with grammatical annotation.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 25

    t2t-pipe

    automatic alignment pipeline for parallel treebanks

    The *Tree-to-Tree (t2t) Alignment Pipe* is a collection of python scripts, co-ordinating the process of automatic alignment of parallel treebanks from plain text files with a single call from a unix command line. Supported Languages: DE, FR, EN
    Downloads: 0 This Week
    Last Update:
    See Project