Showing 22 open source projects for "python text parser"

View related business solutions
  • Gen AI apps are built with MongoDB Atlas Icon
    Gen AI apps are built with MongoDB Atlas

    Build gen AI apps with an all-in-one modern database: MongoDB Atlas

    MongoDB Atlas provides built-in vector search and a flexible document model so developers can build, scale, and run gen AI apps without stitching together multiple databases. From LLM integration to semantic search, Atlas simplifies your AI architecture—and it’s free to get started.
    Start Free
  • Get the most trusted enterprise browser Icon
    Get the most trusted enterprise browser

    Advanced built-in security helps IT prevent breaches before they happen

    Defend against security incidents with Chrome Enterprise. Create customizable controls, manage extensions and set proactive alerts to keep your data and employees protected without slowing down productivity.
    Download Chrome
  • 1
    jsoup

    jsoup

    Java library for working with real-world HTML

    jsoup is a Java library for working with real-world HTML. It provides a very convenient API for fetching URLs and extracting and manipulating data, using the best of HTML5 DOM methods and CSS selectors. jsoup implements the WHATWG HTML5 specification, and parses HTML to the same DOM as modern browsers do. jsoup is designed to deal with all varieties of HTML found in the wild; from pristine and validating, to invalid tag-soup; jsoup will create a sensible parse tree. The parser will make every...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 2

    xsd2pgschema

    Relational database replication tool based on XML Schema

    xsd2pgschema is a Java application suite, which converts XML Schema 1.1 (hierarchical data model) to PostgreSQL DDL (relational data model) and supports XML data migration into PostgreSQL based on the XML Schema without defects on information content. It also supports full-text indexing via either Apache Lucene or Sphinx Search utilizing the relational data model. File conversion from XML to CSV, TSV, or JSON is possible as well as mapping XML Schema to JSON Schema. Obtained PostgreSQL...
    Downloads: 6 This Week
    Last Update:
    See Project
  • 3

    JSONjuicer

    JSON parser and encoder

    A Java open-source library which makes encoding and decoding Java data-structures to and from JSON text easy and intuitive.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 4
    Jericho HTML Parser is a java library allowing analysis and manipulation of parts of an HTML document, including server-side tags, while reproducing verbatim any unrecognised or invalid HTML.
    Downloads: 5 This Week
    Last Update:
    See Project
  • Our Free Plans just got better! | Auth0 Icon
    Our Free Plans just got better! | Auth0

    With up to 25k MAUs and unlimited Okta connections, our Free Plan lets you focus on what you do best—building great apps.

    You asked, we delivered! Auth0 is excited to expand our Free and Paid plans to include more options so you can focus on building, deploying, and scaling applications without having to worry about your security. Auth0 now, thank yourself later.
    Try free now
  • 5
    Human Speakable Programming Language

    Human Speakable Programming Language

    foundation of the General Intelligence Operating System

    HSPL is Human Speakable Programming Language, allowing for communication between human-to-computer and human-to-human in the same language. This project has moved to http://sourceforge.net/p/spel We are currently working on human-to-computer programming-language with mostly English base vocabulary. Though once we have that, we plan to add support for other world Languages, including Chinese, Spanish, Russian, Arabic, Hindi, among others. Eventually HSPL shall be the...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 6
    Simple Java delimited and fixed width file parser. Handles CSV, Excel CSV, Tab, Pipe delimiters, just to name a few. Maps column positions in the file to user friendly names via XML. See "FlatPack Feature List" under News for complete feature list.
    Downloads: 7 This Week
    Last Update:
    See Project
  • 7
    A java-based parser for parsing/grabbing web sites and other text or XML documents, based on a nondeterministic parser language, creating XML output. Also contains a few utility classes for HTML, CSV and text parsing, and additional character sets.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 8
    Easy Equations

    Easy Equations

    Hand Written Equation Creator

    Easy Equations is a utility using which one can write the mathematical equations. The focus of this utility is to provide user friendly access to write mathematical equations which is helpful for students, lecturers, mathematicians and Research persons who prefer using mathematical equations in a document, PowerPoint or web sites. Works on Windows as well as Linux platforms. Software Requirements: JDK 7 or higher. Linux Platform with kernel version 2.7 or higher.(for Linux users).python...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 9
    Stand-alone Java library implementing parser/formatter/comparator/validator for JSON/XML-like text formats oriented on JSON-like object model (list,map,scalar + reflection). Library is designed to maximize adaptivity via set of extendable modules.
    Downloads: 0 This Week
    Last Update:
    See Project
  • Deliver secure remote access with OpenVPN. Icon
    Deliver secure remote access with OpenVPN.

    Trusted by nearly 20,000 customers worldwide, and all major cloud providers.

    OpenVPN's products provide scalable, secure remote access — giving complete freedom to your employees to work outside the office while securely accessing SaaS, the internet, and company resources.
    Get started — no credit card required.
  • 10

    ExiProcessor

    Command-line program for processing Efficient XML Interchange (EXI)

    ExiProcessor is a command-line program that encodes text XML files into binary EXI and decodes EXI files into XML. It uses the open source Java-based library EXIficient (http://exificient.sourceforge.net) as the EXI parser. In essence, ExiProcessor is a command-line interface to EXIficient. ExiProcessor can help people learn about the various EXI encoding and decoding options and how those options affect compression ratios. The source code itself can also be used as an example of how...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 11

    knowceans

    Utility classes from maps to search engine to random samplers

    Collection of several multi-purpose Java libraries. --- knowceans-tools = collection of Java utility classes. --- Highlights: --- org.knowceans.util: IndexQuickSort, TableList: apply order of one array/list to others +++ Vectors, ArrayUtils: array convenience +++ RandomSamplers, CokusRandom, ArmSampler, Densities: random sampling and distributions +++ Arguments: command line parser +++ StopWatch, Which, ExternalProcess: runtime stuff +++ ParallelFor: OpenMP workalike +++ PatternString...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 12

    X2S

    A simple DOM parser in Java

    X2S is a very simple DOM parser. The aim is to retrieve attributes and text-values from an XML DOM document in the case that you don't need all features and complexity of XPath and/or XQuery. For example, if it could be useful a method to retrieve all text-values from a DOM relative path like: /root-node/persons/person/name or the attributes from a string like: /root-node/persons/person/id
    Downloads: 0 This Week
    Last Update:
    See Project
  • 13
    Picxml is a tool for xml files management. Used as command: Picxml is a shell friendly, xpath like query tool. Used as library: picxml, is a full featured xml Parser, wich allows you to navigate, find/create/update/delete/print nodes.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 14
    WSFuzzer
    WSFuzzer is a fuzzing penetration testing tool used against HTTP SOAP based web services. It tests numerous aspects (input validation, XML Parser, etc) of the SOAP target. It is only to be used against targets that have granted permission to be teste
    Downloads: 5 This Week
    Last Update:
    See Project
  • 15
    The aim of this project is to develop a Portable Document Format (PDF) importer for OpenOffice.org Writer based on XPDF. This project was inspired by the PDF importer within KWord.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 16
    Some text parser, that transforms a custom TeX-like syntax into XML. Un parseur, qui produit du XML à partir d'une syntaxe ressemblant à TeX.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 17
    LiMa means Lightweight Markup Language. It is a parser for an easy to use ASCII/Text-based markup - comparable to Markdown or the Wikipedia-Markup language with special configurable extensions in defining Links and image-resources.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 18
    Markout is a pure-Java lightweight wiki markup parser based on John Gruber's Markdown.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 19
    This is a small command-line program to split a phone bill into several seperate ones to categorise the calls (e.g. for illustration which family member phones how much). It generates HTML files and is able to send them to specified email adresses.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 20
    Xion (pronounced "zaion") is a text-based serialization format of object states. Xion extends the JSON format by permitting "tagged" notations. This project provides an implementation of Xion in Java, including the object model, parser and utilities.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 21
    A java library for parsing Self-Defining Text Archive and Retrieval (STAR) files. These form the basis of CIF and mmCIF formats.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 22
    Java flat text parser. Parse any delimited or fixed width strings. Bind user friendly column names to text positions via XML. Easily parse strings, dates, doubles, integers, etc.
    Downloads: 0 This Week
    Last Update:
    See Project
  • Previous
  • You're on page 1
  • Next
Want the latest updates on software, tech news, and AI?
Get latest updates about software, tech news, and AI from SourceForge directly in your inbox once a month.