Showing 386 open source projects for "python text parser"

View related business solutions
  • Gen AI apps are built with MongoDB Atlas Icon
    Gen AI apps are built with MongoDB Atlas

    Build gen AI apps with an all-in-one modern database: MongoDB Atlas

    MongoDB Atlas provides built-in vector search and a flexible document model so developers can build, scale, and run gen AI apps without stitching together multiple databases. From LLM integration to semantic search, Atlas simplifies your AI architecture—and it’s free to get started.
    Start Free
  • Build Securely on Azure with Proven Frameworks Icon
    Build Securely on Azure with Proven Frameworks

    Lay a foundation for success with Tested Reference Architectures developed by Fortinet’s experts. Learn more in this white paper.

    Moving to the cloud brings new challenges. How can you manage a larger attack surface while ensuring great network performance? Turn to Fortinet’s Tested Reference Architectures, blueprints for designing and securing cloud environments built by cybersecurity experts. Learn more and explore use cases in this white paper.
    Download Now
  • 1

    TISP

    A simple, LISP-like programming language.

    ThyLordRoot's Interpreted S-expression Parser (TISP) is a LISP-like programming language developed for the benefit of ThyLordRoot's viewers on Twitch.tv. This heritage was chosen for its simplicity and affordance of rapid prototyping. Although it is a toy language, it is intended to be Turing complete, and bootstrapping TISP is a major goal.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 2

    Klonwerk

    advanced variability management tool

    Klonwerk is an advanced and easy-to-use tool for reactive variability management and large-scale software reuse. It allows you to automatically clone and modify directories of textural or binary files (e.g. code, architecture, documentation), based on conceptual feature/decision models.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 3
    Jericho HTML Parser is a java library allowing analysis and manipulation of parts of an HTML document, including server-side tags, while reproducing verbatim any unrecognised or invalid HTML.
    Downloads: 4 This Week
    Last Update:
    See Project
  • 4
    Knwl.js

    Knwl.js

    A .js library for parsing text for specific information

    Knwl.js is a Javascript library that parses through text for dates, times, phone numbers, emails, places, and more. Parser plugins are what make Knwl.js tick and give it new parsing abilities. There are many already available under ./default_plugins for a number of tasks. If you're interested in developing plugins for Knwl.js, ./plugin_development.md is a great place to start. You can optionally specify a language in the first parameter. This helps plugins identify and better suit particular...
    Downloads: 0 This Week
    Last Update:
    See Project
  • Build Securely on AWS with Proven Frameworks Icon
    Build Securely on AWS with Proven Frameworks

    Lay a foundation for success with Tested Reference Architectures developed by Fortinet’s experts. Learn more in this white paper.

    Moving to the cloud brings new challenges. How can you manage a larger attack surface while ensuring great network performance? Turn to Fortinet’s Tested Reference Architectures, blueprints for designing and securing cloud environments built by cybersecurity experts. Learn more and explore use cases in this white paper.
    Download Now
  • 5
    Pootle, Virtaal & Translate Toolkit

    Pootle, Virtaal & Translate Toolkit

    Localization tools built by localizers for localizers

    Tools for localization: - Pootle: web based translation management system. - Virtaal: Computer Aided Translation (CAT) tool. - Translate Toolkit: QA, format conversion and support (PO, Java .properties, OpenOffice, Mozilla, XLIFF, TMX, TBX, CSV, Qt .ts).
    Leader badge
    Downloads: 81 This Week
    Last Update:
    See Project
  • 6
    That project aims at providing a clean API, and the corresponding C++ implementation, for parsing travel-focused requests (e.g., "washington dc beijing monday r/t +aa -ua 1 week 2 adults 1 dog").
    Downloads: 0 This Week
    Last Update:
    See Project
  • 7
    go_spider

    go_spider

    An awesome Go concurrent Crawler(spider) framework

    ... will converse to json. Text form represents plain text content without a parser. The PageProcesser moduler only parse results. The moduler gets results(key-value pairs) and URLs to be crawled next step. These key-value pairs will be saved in PageItems and urls will be pushed in Scheduler.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 8
    Qfsm

    Qfsm

    A graphical Finite State Machine (FSM) designer.

    A graphical tool for designing finite state machines and exporting them to Hardware Description Languages, such as VHDL, AHDL, Verilog, or Ragel/SMC files for C, C++, Objective-C, Java, Python, PHP, Perl, Lua code generation.
    Leader badge
    Downloads: 14 This Week
    Last Update:
    See Project
  • 9

    isbntools

    A command line tool to extract, transform and get metadata for ISBNs

    As of 2015-06-02, this project is no longer under active development.
    Downloads: 0 This Week
    Last Update:
    See Project
  • Photo and Video Editing APIs and SDKs Icon
    Photo and Video Editing APIs and SDKs

    Trusted by 150 million+ creators and businesses globally

    Unlock Picsart's full editing suite by embedding our Editor SDK directly into your platform. Offer your users the power of a full design suite without leaving your site.
    Learn More
  • 10
    ... of an "assertion" mechanism, it auto-generates a log of the GUI appearance and changes to it. The point is then to use that as a baseline for text-based testing, using e.g. TextTest. It also includes support for instrumenting code so that "waits" can be recorded, making it far easier for a tester to record correctly synchronized tests without having to explicitly plan for this.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 11

    HTML XHTML Parser + XPath

    Delphi HTML XHTML Parser +XPath

    Delphi HTML Parser This module lets you work with HTML documents as DOM tree and use XPath for searching tags. It is very simple way to parse HTML. This tested with version Delphi XE5,6 Usage Add in Uses parser.pas; begin HtmlTxt:= ''; //here your html NodeList:= TNodeList.Create; ValueList:= TStringList.Create; DomTree:= TDomTree.Create; DomTreeNode:= DomTree.RootNode; If DomTreeNode.RunParse(HtmlTxt) then begin {your code example
    Downloads: 0 This Week
    Last Update:
    See Project
  • 12
    PyPE (Python Programmers Editor) was written in order to offer a lightweight but powerful editor for those of you who think emacs is too much and idle is too little. Syntax highlighting is included out of the box, as is multiple open documents via tabs.
    Downloads: 7 This Week
    Last Update:
    See Project
  • 13
    Human Speakable Programming Language

    Human Speakable Programming Language

    foundation of the General Intelligence Operating System

    HSPL is Human Speakable Programming Language, allowing for communication between human-to-computer and human-to-human in the same language. This project has moved to http://sourceforge.net/p/spel We are currently working on human-to-computer programming-language with mostly English base vocabulary. Though once we have that, we plan to add support for other world Languages, including Chinese, Spanish, Russian, Arabic, Hindi, among others. Eventually HSPL shall be the...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 14
    UA-Parser

    UA-Parser

    A multi-language port of Browserscope's user agent parser

    ua-parser is a multi-language library for parsing User-Agent strings into structured data, such as browser, OS, and device information. Originally created for web analytics and user tracking, it supports a wide range of platforms and is available in JavaScript, Python, Java, and other languages. ua-parser is often used in server logs, analytics dashboards, and A/B testing systems to segment traffic.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 15
    snipMate & UltiSnip Snippets

    snipMate & UltiSnip Snippets

    vim-snipmate default snippets

    This repository contains snippets files for various programming languages. It is community-maintained and many people have contributed snippet files and other improvements already. There are different forks of snippet engines that allow the user to insert snippets by typing the name of a snippet hitting the expansion mapping. UltiSnips has additional features such as high speed, nesting snippets, expanding snippets in snippets and offers powerful transformations on text in snippets (like visual...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 16
    pyz80 is a z80 cross-assembler written entirely in Python. It accepts text files with the syntax of the Sam Coupe's popular COMET assembler, and outputs a disk image suitable for loading in SimCoupe or transferring to floppy disk for use with a real Sam.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 17

    pyWeb Literate Programming Tool

    Literate Programming in pure Python

    pyWeb is a Literate Programming tool that will work with any markup language and any programming langauge. The idea is to allow you to create great documentation with as constraints or limitations.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 18
    IronMeta is an implementation of Alessandro Warth's OMeta metaprogramming system in C#. It provides a packrat parser generator that generates parsers for Parsing Expression Grammars that operate on arbitrary streams of objects.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 19

    sitecheck

    Modular web site spider for web developers.

    More than just a link checker, sitecheck is a website spider (also known as a crawler) which can assist with SEO by testing an entire site plus both inbound links from search engines and outbound links to other sites for the following issues: looping redirects (HTTP 301/302), broken links (HTTP 404), server errors (HTTP 500), spelling mistakes, low readability scores (using the Flesch Reading Ease test), missing/empty/duplicate meta tags, duplicate content, slow page speed, W3C validation...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 20

    SVNStartCommitHelper

    Useful form to support SVN Commits as an SVN Start-Commit Hook Script

    Professional environments focus on high development standards in Source Code Management. E.g. by usage of server side commit hooks to check for minimum acceptance levels on code and documentation quality including commit message structure and content. TortoiseSVN offers only a free form text field to edit inside the Commit Dialog. Developers might recall situations when struggling with commit message structure and fighting the server side commit hooks instead of focusing on message content...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 21
    meld-installer

    meld-installer

    Meld Installer for Windows

    Bundles Portable Python (with PyGTK) and Meld together in an easy to use installer. This allows you to not have to worry about setting up Python or PyGTK and you can keep Meld's Python separate from other Python installations on your machine. ** NOTE ** Meld 3.11 and later now have official installers, hence this project is no longer supported. You can download the new installer here: https://download.gnome.org/binaries/win32/meld/. You should uninstall the old 1.8 version before upgrading.
    Downloads: 30 This Week
    Last Update:
    See Project
  • 22
    TextBlob

    TextBlob

    TextBlob is a Python library for processing textual data

    Simple, Pythonic, text processing, Sentiment analysis, part-of-speech tagging, noun phrase extraction, translation, and more. It provides a simple API for diving into common natural language processing (NLP) tasks such as part-of-speech tagging, noun phrase extraction, sentiment analysis, classification, translation, and more. TextBlob stands on the giant shoulders of NLTK and pattern, and plays nicely with both. Supports word inflection (pluralization and singularization) and lemmatization...
    Downloads: 2 This Week
    Last Update:
    See Project
  • 23
    DrPython is a highly customizable cross-platform ide to aid programming in Python. It was developed with teaching in mind, and has a clean, simple interface. It is written in Python, using wxPython as the gui.
    Downloads: 12 This Week
    Last Update:
    See Project
  • 24

    Yacap

    Yet Another Command Argument Parser

    A highly configurable command line parser with help generation support. * Easy to configure option sets based on builder patterns * Typed options: boolean, string, int, long, double and enum * Dual POSIX short and GNU long parameter names * Support for unnamed (such as file list after options) and internal options (options are unparsed after --) * One step parsing * Result set with safe and typed getters * Error messages and warnings returned by result set * Text-based help...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 25
    Python Tkinter Extensions

    Python Tkinter Extensions

    Additional Graphical Classes and Widgets for Tkinter/Ttk

    This is a Python package with extensions for Tkinter. It supports both Python 2.7 and Python 3.3. Currently, the project is in version 1.0 of the Production stage. Feel free to fork off the main project or send me possible additions to the package.
    Downloads: 1 This Week
    Last Update:
    See Project
Want the latest updates on software, tech news, and AI?
Get latest updates about software, tech news, and AI from SourceForge directly in your inbox once a month.