Showing 105 open source projects for "python text parser"

View related business solutions
  • MongoDB Atlas runs apps anywhere Icon
    MongoDB Atlas runs apps anywhere

    Deploy in 115+ regions with the modern database for every enterprise.

    MongoDB Atlas gives you the freedom to build and run modern applications anywhere—across AWS, Azure, and Google Cloud. With global availability in over 115 regions, Atlas lets you deploy close to your users, meet compliance needs, and scale with confidence across any geography.
    Start Free
  • Keep company data safe with Chrome Enterprise Icon
    Keep company data safe with Chrome Enterprise

    Protect your business with AI policies and data loss prevention in the browser

    Make AI work your way with Chrome Enterprise. Block unapproved sites and set custom data controls that align with your company's policies.
    Download Chrome
  • 1

    Safe Harbor Deidentification

    Safe Harbor Deidentification for medical documents

    Phalanx - Deidentify Safe Harbor Deidentification Mode of Phalanx is an abridged pipeline of NLP annotators culminating in NER annotators which write output of text offsets. It uses the Safe Harbor deidentification method.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 2
    Xplico

    Xplico

    Xplico is a Network Forensic Analysis Tool (NFAT)

    Xplico is a Network Forensic Analysis Tool (NFAT). The goal of Xplico is extract from an internet traffic capture the applications data contained. For example, from a pcap file Xplico extracts each email (POP, IMAP, and SMTP protocols), all HTTP contents, each VoIP call (SIP, MGCP, MEGACO, RTP), IRC, WhatsApp... Xplico is able to classify more than 140 (application) protocols. Xplico cam be used as sniffer-decoder if used in "live mode" or in conjunction with netsniff-ng. Xplico is used...
    Downloads: 17 This Week
    Last Update:
    See Project
  • 3
    CMD Plot Tool

    CMD Plot Tool

    Calculates and plots Colour Magnitude Diagrams from Astronomical data

    CMD Plot Tool calculates and plots Colour Magnitude Diagrams (CMDs) from astronomical photometric data, e.g. of a star cluster observed in two filter bandpasses. It can handle multiple file formats (plain text, DAOPHOT .mag files, ACS Survey of Galactic Globular Clusters .zpt files) to generate professional and customised plots, without the usual steep learning curve. Development using Python, OOP and a formal software development lifecycle model (Agile) allowed for the creation...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 4

    Arabic Corpus

    Text categorization, arabic language processing, language modeling

    The Arabic Corpus {compiled by Dr. Mourad Abbas ( http://sites.google.com/site/mouradabbas9/corpora ) The corpus Khaleej-2004 contains 5690 documents. It is divided to 4 topics (categories). The corpus Watan-2004 contains 20291 documents organized in 6 topics (categories). Researchers who use these two corpora would mention the two main references: (1) For Watan-2004 corpus ---------------------- M. Abbas, K. Smaili, D. Berkani, (2011) Evaluation of Topic Identification Methods on...
    Leader badge
    Downloads: 10 This Week
    Last Update:
    See Project
  • Our Free Plans just got better! | Auth0 Icon
    Our Free Plans just got better! | Auth0

    With up to 25k MAUs and unlimited Okta connections, our Free Plan lets you focus on what you do best—building great apps.

    You asked, we delivered! Auth0 is excited to expand our Free and Paid plans to include more options so you can focus on building, deploying, and scaling applications without having to worry about your security. Auth0 now, thank yourself later.
    Try free now
  • 5

    TimingDrawer

    Text based timing diagram generator

    This tool generates timing diagrams for documenting hardware design. It reads the description from a text file with a simple syntax. It generates vector graphic (EPS, SVG or EMF format). It can be used in command line mode or with a GUI. It is written in Python and works on any platform.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 6

    Presage

    the intelligent predictive text entry platform

    Presage (formerly Soothsayer) is an intelligent predictive text entry system. Presage generates predictions by modelling natural language as a combination of redundant information sources. Presage computes probabilities for words which are most likely to be entered next by merging predictions generated by the different predictive algorithms. Presage's modular and extensible architecture allows its language model to be extended and customized to utilize statistical, syntactic, and semantic...
    Leader badge
    Downloads: 15 This Week
    Last Update:
    See Project
  • 7

    dadosSemiotica

    Collecter and manager of semiotica annalisis data

    This program is a web application to collect and organize data of text analysis. It works with sets of texts and the analysis are done on portions of the length of a sentence. One of the preprocessing modules is based on CoGroo (A LibreOffice & OpenOffice.org Portuguese Grammar Checker).
    Downloads: 0 This Week
    Last Update:
    See Project
  • 8
    aioulinux

    aioulinux

    Linux for Arduino and Makers developers

    Hello, I'm the Aioulinux founder, eager to professionally revive the project. Since 2018, the demand for an IoT and Arduino-tailored environment has been evident. Seeking partners for a 2024 version targeting schools and IoT companies, aiming for a secure and comprehensive platform. If you share this vision and wish to collaborate, reach out. Let's revive Aioulinux stronger than ever! Now seeking partners: Live Distro Specialist: Expert in live distributions to ensure...
    Downloads: 2 This Week
    Last Update:
    See Project
  • 9
    TEACUP

    TEACUP

    TCP Experiment Automation Controlled Using Python

    TEACUP automates many aspects of running TCP performance experiments in a specially-constructed physical testbed. TEACUP enables repeatable testing of different TCP algorithms over a range of emulated network path conditions, bottleneck rate limits and bottleneck queuing disciplines. TEACUP utilises a text-based configuration file to define experiments as combinations of parameters specifying desired network path and end host conditions. When multiple values are provided (e.g. for TCP...
    Downloads: 1 This Week
    Last Update:
    See Project
  • No-Nonsense Code-to-Cloud Security for Devs | Aikido Icon
    No-Nonsense Code-to-Cloud Security for Devs | Aikido

    Connect your GitHub, GitLab, Bitbucket, or Azure DevOps account to start scanning your repos for free.

    Aikido provides a unified security platform for developers, combining 12 powerful scans like SAST, DAST, and CSPM. AI-driven AutoFix and AutoTriage streamline vulnerability management, while runtime protection blocks attacks.
    Start for Free
  • 10

    TEES

    Turku Event Extraction System

    Turku Event Extraction System (TEES) is a free and open source natural language processing system developed for the extraction of events and relations from biomedical text. It is written mostly in Python, and should work in generic Unix/Linux environments. Currently, the TEES source code repository still remains on GitHub at http://jbjorne.github.com/TEES/ where there is also a wiki with more information.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 11

    CorNetMap

    A tool for Gene Expression Correlation Network

    Capabilities of CorNetMap: 1. Read data as tab-delimited text file. Can be used for analysis of any data set beyond gene expression. 2. Capable of both two-dimensional and multidimensional data analysis. 3. Calculate Pearson correlation and cross-correlation for analysis data with phase difference. 4. Generate correlation Heat-map and draws network map. 5. Save correlation data as text file. How to use and doccumentation: https://sourceforge.net/projects/cornetmap/files...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 12

    BioC

    We describe a simple XML format to share text documents and annotation

    A minimalist approach to share text documents and data annotations. Allows a large number of different annotations to be represented. Project files contain: - simple code to hold/read/write data and perform sample processing. - BioC-formatted corpora - BioC tools that work with BioC corpora BioC goals - simplicity - interoperability - broad use - reuse There should be little investment required to learn to use a format or a software module to process that format. We...
    Leader badge
    Downloads: 8 This Week
    Last Update:
    See Project
  • 13
    Buildes

    Buildes

    A designer’s program for describing parts of the building

    Buildes is an integrated development environment that assists the user in creating building information. It reads a text (session) file from which it compiles the information. It then allows the user to browse, analyze and export the resulting building knowledge. The compilation system is written in pure Python. The building components created are rendered in PythonOCC. The GUI is written using PyQt.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 14
    orbkit (Moved to Github)

    orbkit (Moved to Github)

    A Modular Python Toolbox for Cross-Platform Post-Processing of Quantum

    PLEASE NOTE ORBKIT HAS BEEN MOVED TO https://github.com/orbkit/orbkit orbkit is a parallel Python program package for post-processing wave function data extracted from output files of MOLPRO (Molden File Format), TURBOMOLE (AOMix file format), GAMESS-US, PROAIMS/AIMPAC (wfn/wfx file format), and Gaussian (Output File and Formatted Checkpoint File) output files. Futhermore, an interface to cclib, a parser for quantum chemical logfiles, is provided. If you use orbkit in your work, please...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 15
    Part-of-speech tagging is the task of assigning symbols from a particular set to words in a natural language text. ACOPOST implements and extends well-known machine learning techniques and provides a uniform environment for testing.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 16
    SciEnPlot

    SciEnPlot

    Data Plotting and Analysis for Science and Engineering

    ... for the labels in Graph - Accept txt(Text) and csv(Comma separated values) formatted data - Save, copy, print Graph - Use spread sheets to display data - Textboard to organize the results - Graphs in a publishable quality - Source codes based on: Python Numpy Scipy Matplotlib WxPython Visvis etc. - Special functions - Drag and drop data files - Python console is back (since v1.3.3), capable of reusing column data - Debye and Guinier models for SANS / SAX data - More apps in our Web below
    Leader badge
    Downloads: 3 This Week
    Last Update:
    See Project
  • 17

    mwetoolkit

    THIS PROJECT MIGRATED TO https://gitlab.com/mwetoolkit/mwetoolkit3/

    ... to virtually any text collection, language, and MWE type. It is a command-line tool written mostly in Python. Its development started in 2010 as a PhD thesis but the project keeps active (see the SVN logs). Up-to-date documentation and details about the tool can be found on the mwetoolkit website: http://mwetoolkit.sourceforge.net/
    Downloads: 0 This Week
    Last Update:
    See Project
  • 18
    AsiEs stands for Asistente de Escritura (writing assistant). It provides word prediction and autocomplete for fast writing. Thought for people with difficulties writing on keyboard, improves the writing speed preventing the user from pressing at most 50% of keys to write and avoids ortographic errors. Made by Fundación Teletón Uruguay (http://www.teleton.org.uy/home/)
    Downloads: 0 This Week
    Last Update:
    See Project
  • 19

    DEAPathways

    Differential Expression Analysis for Pathways

    This project contains the source code associated with the PLoS Computational Biology publication: "Differential Expression Analysis for Pathways". The paper text can be found here: http://www.ploscompbiol.org/article/info%3Adoi%2F10.1371%2Fjournal.pcbi.1002967
    Downloads: 0 This Week
    Last Update:
    See Project
  • 20

    iMir

    Integrated pipeline for HT miRNA-Seq data analysis

    ... pipeline, called iMir, integrating multiple open source modules and resource in an automated workflow, devising different statistical approaches to analyze data rigorously. iMir comprises also a Graphical User Interface (GUI), so that the pipeline is particularly suited for biologist and early stage bioinformaticians and produces both graphics and text outputs.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 21
    Mishkal: Arabic Text Vocalization

    Mishkal: Arabic Text Vocalization

    Arabic Text Vocalization system

    Automatic system of vocalization of arabic text.
    Downloads: 86 This Week
    Last Update:
    See Project
  • 22
    SAMoCAD

    SAMoCAD

    Simple CAD program

    SAMoCAD - its program to create simple drawings, partially supports DXF. Main features: - create simple objects (line, arc, circle) - create complex objects (text, dimensions) - edit drawn objects - save drawing to SVG format - output drawing in PostScript format - export/import content DXF files (primitives LINE, CIRCLE, ARC, TEXT, DIMENSION) Program written in Python 2 and use the library Tkinter. Program requires no installation, but you need install Python 2.7 or later on your...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 23

    quichem

    Utilities for fast input of chemical formulae and equations.

    *** New releases of quichem will only be available on GitHub. *** quichem is a utility written in pure Python allowing for fast input and parsing of chemical formulae and equations. Entering data does not require any modifier keys or keystrokes to be pressed. This SourceForge page provided pre-built Windows binaries of quichem-pyside. **To run quichem on Mac OS X or Linux**, or with the wxPython front-end, please download the source from quichem's GitHub page at: github.com/spamalot...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 24
    Russian morphology tagger. Parses text(s) and output xml representation of text(s) with grammatical annotation.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 25

    t2t-pipe

    automatic alignment pipeline for parallel treebanks

    The *Tree-to-Tree (t2t) Alignment Pipe* is a collection of python scripts, co-ordinating the process of automatic alignment of parallel treebanks from plain text files with a single call from a unix command line. Supported Languages: DE, FR, EN
    Downloads: 0 This Week
    Last Update:
    See Project
Want the latest updates on software, tech news, and AI?
Get latest updates about software, tech news, and AI from SourceForge directly in your inbox once a month.