Showing 30 open source projects for "python text parser"

View related business solutions
  • Gen AI apps are built with MongoDB Atlas Icon
    Gen AI apps are built with MongoDB Atlas

    Build gen AI apps with an all-in-one modern database: MongoDB Atlas

    MongoDB Atlas provides built-in vector search and a flexible document model so developers can build, scale, and run gen AI apps without stitching together multiple databases. From LLM integration to semantic search, Atlas simplifies your AI architecture—and it’s free to get started.
    Start Free
  • Deliver secure remote access with OpenVPN. Icon
    Deliver secure remote access with OpenVPN.

    Trusted by nearly 20,000 customers worldwide, and all major cloud providers.

    OpenVPN's products provide scalable, secure remote access — giving complete freedom to your employees to work outside the office while securely accessing SaaS, the internet, and company resources.
    Get started — no credit card required.
  • 1
    AquaTerm is a Mac OS X grahics renderer. It allows command line applications written in ObjC, C, FORTRAN, Lisp, Perl or Python to display vector graphics, text and images using a simple API. Adapters for gnuplot, PGPLOT, and PLplot exists as well.
    Downloads: 6 This Week
    Last Update:
    See Project
  • 2
    Hacker Scripts

    Hacker Scripts

    Based on a true story

    Hacker Scripts is a cheeky collection of small automation scripts and language ports collected under the tagline “Based on a true story.” The repository gathers playful utilities (originally shell and Ruby scripts) that automate short, real-world tasks — for example, sending a quick “late at work” text when SSH sessions are active, firing off an automated “I’m sick / working from home” email on certain mornings, or even talking to a networked coffee machine to start brewing at precisely...
    Downloads: 9 This Week
    Last Update:
    See Project
  • 3

    Safe Harbor Deidentification

    Safe Harbor Deidentification for medical documents

    Phalanx - Deidentify Safe Harbor Deidentification Mode of Phalanx is an abridged pipeline of NLP annotators culminating in NER annotators which write output of text offsets. It uses the Safe Harbor deidentification method.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 4
    aioulinux

    aioulinux

    Linux for Arduino and Makers developers

    Hello, I'm the Aioulinux founder, eager to professionally revive the project. Since 2018, the demand for an IoT and Arduino-tailored environment has been evident. Seeking partners for a 2024 version targeting schools and IoT companies, aiming for a secure and comprehensive platform. If you share this vision and wish to collaborate, reach out. Let's revive Aioulinux stronger than ever! Now seeking partners: Live Distro Specialist: Expert in live distributions to ensure...
    Downloads: 2 This Week
    Last Update:
    See Project
  • No-Nonsense Code-to-Cloud Security for Devs | Aikido Icon
    No-Nonsense Code-to-Cloud Security for Devs | Aikido

    Connect your GitHub, GitLab, Bitbucket, or Azure DevOps account to start scanning your repos for free.

    Aikido provides a unified security platform for developers, combining 12 powerful scans like SAST, DAST, and CSPM. AI-driven AutoFix and AutoTriage streamline vulnerability management, while runtime protection blocks attacks.
    Start for Free
  • 5
    htmlarea

    htmlarea

    Small, powerful, full featured WYSIWYG editor

    HTMLArea 4 is a browser based WYSIWYG editor that easily replaces the TEXTAREA in your web pages. It is written in JavaScript, and suitable for use in any modern web browser, and any page on your web site. Current version is 4.0-2016-08-29
    Downloads: 2 This Week
    Last Update:
    See Project
  • 6

    BioC

    We describe a simple XML format to share text documents and annotation

    A minimalist approach to share text documents and data annotations. Allows a large number of different annotations to be represented. Project files contain: - simple code to hold/read/write data and perform sample processing. - BioC-formatted corpora - BioC tools that work with BioC corpora BioC goals - simplicity - interoperability - broad use - reuse There should be little investment required to learn to use a format or a software module to process that format. We...
    Leader badge
    Downloads: 8 This Week
    Last Update:
    See Project
  • 7
    Part-of-speech tagging is the task of assigning symbols from a particular set to words in a natural language text. ACOPOST implements and extends well-known machine learning techniques and provides a uniform environment for testing.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 8

    iMir

    Integrated pipeline for HT miRNA-Seq data analysis

    ... pipeline, called iMir, integrating multiple open source modules and resource in an automated workflow, devising different statistical approaches to analyze data rigorously. iMir comprises also a Graphical User Interface (GUI), so that the pipeline is particularly suited for biologist and early stage bioinformaticians and produces both graphics and text outputs.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 9
    UA-Parser

    UA-Parser

    A multi-language port of Browserscope's user agent parser

    ua-parser is a multi-language library for parsing User-Agent strings into structured data, such as browser, OS, and device information. Originally created for web analytics and user tracking, it supports a wide range of platforms and is available in JavaScript, Python, Java, and other languages. ua-parser is often used in server logs, analytics dashboards, and A/B testing systems to segment traffic.
    Downloads: 0 This Week
    Last Update:
    See Project
  • Build Securely on AWS with Proven Frameworks Icon
    Build Securely on AWS with Proven Frameworks

    Lay a foundation for success with Tested Reference Architectures developed by Fortinet’s experts. Learn more in this white paper.

    Moving to the cloud brings new challenges. How can you manage a larger attack surface while ensuring great network performance? Turn to Fortinet’s Tested Reference Architectures, blueprints for designing and securing cloud environments built by cybersecurity experts. Learn more and explore use cases in this white paper.
    Download Now
  • 10

    Nehul Singh IDE for .Net

    NS-ORG IDE for DOT NET Programming

    ...,clipboard mapping and the best of all is it can convert code of VB.Net-Python,BOO,C#,Ruby and vice-versa.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 11
    Plotmeister is a data exploration tool. It parses your ASCII data and generates a simple (text-based) table format. You can modify this table and eventually create nice looking figures.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 12
    A set of Unix command line tools for quick and convenient batch processing of tabular text files (a.k.a., tab-delimited, csv, or flat file format) with a header line. Provides delimiter and compression detection, column reference by name. * tblmap: per-line ("map") computation: derive columns through an expression, delete, reorder, filter rows. * tblred: compute ("reduce") aggregations (e.g., sum, average) over groups defined by key columns. * tbldesc: Summarize columns in file (e.g...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 13

    Arabeyes

    A project to increase free Arabic fonts on Unix/Linux

    Arabeyes is a Meta project that is aimed at fully supporting the Arabic language in the Unix/Linux environment. It is designed to be a central location to standardize the Arabization process. Arabeyes relies on voluntary contributions.
    Leader badge
    Downloads: 197 This Week
    Last Update:
    See Project
  • 14
    The goal of this project is to create a user-friendly application with which users can save their page numbers for e-books they are reading on their computers or other devices.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 15
    Sofu is a simple Unicode-based plain text file format designed for easily accessing and storing application data, such as game databases and configuration files. This project also hosts various Sofu parser libraries.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 16
    MutationFinder is a biomedical natural language processing (NLP) system for extracting mentions of point mutations from free text. MutationFinder achieves high performance (99% precision, 81% recall on blind test data) as an information extraction system
    Downloads: 0 This Week
    Last Update:
    See Project
  • 17
    Tomoe is a handwriting character recognition engine.
    Downloads: 19 This Week
    Last Update:
    See Project
  • 18
    Small XML to ASCII Parser written in Perl as an OO-Perl Module. Can be used to extract pure ascii text from a given xml document. Later: extract text from web documents, or given arrays and hashs.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 19
    Luci is an accessibility tool (html parser written in Perl) designed to provide users with a text-only experience of a web site. Inspired by the Betsie project, Luci offers features such as ssl, cookies, frames, 401 authorization (ie. htaccess), etc...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 20
    Berkeley Yacc is an LALR(1) parser generator. Berkeley Yacc has been made as compatible as possible with AT&T Yacc. Berkeley Yacc has been extended to generate Python, Perl and Java code.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 21
    The AbiWord word processor is a full-featured cross-platform word processor.
    Downloads: 82 This Week
    Last Update:
    See Project
  • 22
    plaindb is a plain text database package. It comprises the file format, and a set of modules on Perl, Tcl and python to access the databases, plus a basic shell interface. Higher level interfaces are also included for local (GUI) and web access.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 23
    This is a parser which reads plain-text input files and generates HTML output files. It combines the presentation features of HTML with the simplicity of plain-text notes. Generates HTML index files and hyperlinks for the words you choose to index.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 24
    Projek Munsyi is a project to provide localization (L10N) for the Bahasa Melayu (Malay language) (ms) especially for the GNU software. The goal of this project is to become the hub for the Malay language team at Translation Project.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 25
    Spatial Disruption is a Web based Strategy Warfare game. This project houses both Version 1(Perl, C++, and Flat text Files), and Version 2(PHP, Python, C++, and Various SQL databases). Our game already features a 2-D map, built-in alliances, and more!
    Downloads: 0 This Week
    Last Update:
    See Project
  • Previous
  • You're on page 1
  • 2
  • Next
Want the latest updates on software, tech news, and AI?
Get latest updates about software, tech news, and AI from SourceForge directly in your inbox once a month.