Showing 21 open source projects for "python text parser"

View related business solutions
  • Keep company data safe with Chrome Enterprise Icon
    Keep company data safe with Chrome Enterprise

    Protect your business with AI policies and data loss prevention in the browser

    Make AI work your way with Chrome Enterprise. Block unapproved sites and set custom data controls that align with your company's policies.
    Download Chrome
  • Gen AI apps are built with MongoDB Atlas Icon
    Gen AI apps are built with MongoDB Atlas

    The database for AI-powered applications.

    MongoDB Atlas is the developer-friendly database used to build, scale, and run gen AI and LLM-powered apps—without needing a separate vector database. Atlas offers built-in vector search, global availability across 115+ regions, and flexible document modeling. Start building AI apps faster, all in one place.
    Start Free
  • 1
    jsoup

    jsoup

    Java library for working with real-world HTML

    jsoup is a Java library for working with real-world HTML. It provides a very convenient API for fetching URLs and extracting and manipulating data, using the best of HTML5 DOM methods and CSS selectors. jsoup implements the WHATWG HTML5 specification, and parses HTML to the same DOM as modern browsers do. jsoup is designed to deal with all varieties of HTML found in the wild; from pristine and validating, to invalid tag-soup; jsoup will create a sensible parse tree. The parser will make...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 2

    xsd2pgschema

    Relational database replication tool based on XML Schema

    xsd2pgschema is a Java application suite, which converts XML Schema 1.1 (hierarchical data model) to PostgreSQL DDL (relational data model) and supports XML data migration into PostgreSQL based on the XML Schema without defects on information content. It also supports full-text indexing via either Apache Lucene or Sphinx Search utilizing the relational data model. File conversion from XML to CSV, TSV, or JSON is possible as well as mapping XML Schema to JSON Schema. Obtained PostgreSQL...
    Downloads: 4 This Week
    Last Update:
    See Project
  • 3

    JSONjuicer

    JSON parser and encoder

    A Java open-source library which makes encoding and decoding Java data-structures to and from JSON text easy and intuitive.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 4
    Jericho HTML Parser is a java library allowing analysis and manipulation of parts of an HTML document, including server-side tags, while reproducing verbatim any unrecognised or invalid HTML.
    Downloads: 3 This Week
    Last Update:
    See Project
  • Level Up Your Cyber Defense with External Threat Management Icon
    Level Up Your Cyber Defense with External Threat Management

    See every risk before it hits. From exposed data to dark web chatter. All in one unified view.

    Move beyond alerts. Gain full visibility, context, and control over your external attack surface to stay ahead of every threat.
    Try for Free
  • 5
    Human Speakable Programming Language

    Human Speakable Programming Language

    foundation of the General Intelligence Operating System

    HSPL is Human Speakable Programming Language, allowing for communication between human-to-computer and human-to-human in the same language. This project has moved to http://sourceforge.net/p/spel We are currently working on human-to-computer programming-language with mostly English base vocabulary. Though once we have that, we plan to add support for other world Languages, including Chinese, Spanish, Russian, Arabic, Hindi, among others. Eventually HSPL shall be the...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 6
    A java-based parser for parsing/grabbing web sites and other text or XML documents, based on a nondeterministic parser language, creating XML output. Also contains a few utility classes for HTML, CSV and text parsing, and additional character sets.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 7
    Simple Java delimited and fixed width file parser. Handles CSV, Excel CSV, Tab, Pipe delimiters, just to name a few. Maps column positions in the file to user friendly names via XML. See "FlatPack Feature List" under News for complete feature list.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 8
    Easy Equations

    Easy Equations

    Hand Written Equation Creator

    Easy Equations is a utility using which one can write the mathematical equations. The focus of this utility is to provide user friendly access to write mathematical equations which is helpful for students, lecturers, mathematicians and Research persons who prefer using mathematical equations in a document, PowerPoint or web sites. Works on Windows as well as Linux platforms. Software Requirements: JDK 7 or higher. Linux Platform with kernel version 2.7 or higher.(for Linux users).python...
    Downloads: 2 This Week
    Last Update:
    See Project
  • 9
    Stand-alone Java library implementing parser/formatter/comparator/validator for JSON/XML-like text formats oriented on JSON-like object model (list,map,scalar + reflection). Library is designed to maximize adaptivity via set of extendable modules.
    Downloads: 0 This Week
    Last Update:
    See Project
  • Our Free Plans just got better! | Auth0 Icon
    Our Free Plans just got better! | Auth0

    With up to 25k MAUs and unlimited Okta connections, our Free Plan lets you focus on what you do best—building great apps.

    You asked, we delivered! Auth0 is excited to expand our Free and Paid plans to include more options so you can focus on building, deploying, and scaling applications without having to worry about your security. Auth0 now, thank yourself later.
    Try free now
  • 10

    ExiProcessor

    Command-line program for processing Efficient XML Interchange (EXI)

    ExiProcessor is a command-line program that encodes text XML files into binary EXI and decodes EXI files into XML. It uses the open source Java-based library EXIficient (http://exificient.sourceforge.net) as the EXI parser. In essence, ExiProcessor is a command-line interface to EXIficient. ExiProcessor can help people learn about the various EXI encoding and decoding options and how those options affect compression ratios.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 11

    X2S

    A simple DOM parser in Java

    X2S is a very simple DOM parser. The aim is to retrieve attributes and text-values from an XML DOM document in the case that you don't need all features and complexity of XPath and/or XQuery. For example, if it could be useful a method to retrieve all text-values from a DOM relative path like: /root-node/persons/person/name or the attributes from a string like: /root-node/persons/person/id
    Downloads: 1 This Week
    Last Update:
    See Project
  • 12

    knowceans

    Utility classes from maps to search engine to random samplers

    .... --- Highlights: --- org.knowceans.util: IndexQuickSort, TableList: apply order of one array/list to others +++ Vectors, ArrayUtils: array convenience +++ RandomSamplers, CokusRandom, ArmSampler, Densities: random sampling and distributions +++ Arguments: command line parser +++ StopWatch, Which, ExternalProcess: runtime stuff +++ ParallelFor: OpenMP workalike +++ PatternString, NamedGroupRegex: regex convenience --- org.knowceans.corpus: CorpusSearcher: full-text search engine +++ LabelNumCorpus: svmlight corpus storage and filtering +++ NIPS corpus with text, authors, labels and citations --- org.knowceans.map: InvertibleHashMultiMap, BijectiveHashMap: implement n:m and 1:1 relations. --- Other libs: knowceans-arms = port of the Adaptive Rejection Metropolis Sampler (ARMS) for arbitrary distributions +++ lda-j = port of lda-c, implementing Latent Dirichlet Allocation (LDA)
    Downloads: 0 This Week
    Last Update:
    See Project
  • 13
    Picxml is a tool for xml files management. Used as command: Picxml is a shell friendly, xpath like query tool. Used as library: picxml, is a full featured xml Parser, wich allows you to navigate, find/create/update/delete/print nodes.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 14
    The aim of this project is to develop a Portable Document Format (PDF) importer for OpenOffice.org Writer based on XPDF. This project was inspired by the PDF importer within KWord.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 15
    Some text parser, that transforms a custom TeX-like syntax into XML. Un parseur, qui produit du XML à partir d'une syntaxe ressemblant à TeX.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 16
    LiMa means Lightweight Markup Language. It is a parser for an easy to use ASCII/Text-based markup - comparable to Markdown or the Wikipedia-Markup language with special configurable extensions in defining Links and image-resources.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 17
    Markout is a pure-Java lightweight wiki markup parser based on John Gruber's Markdown.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 18
    This is a small command-line program to split a phone bill into several seperate ones to categorise the calls (e.g. for illustration which family member phones how much). It generates HTML files and is able to send them to specified email adresses.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 19
    Xion (pronounced "zaion") is a text-based serialization format of object states. Xion extends the JSON format by permitting "tagged" notations. This project provides an implementation of Xion in Java, including the object model, parser and utilities.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 20
    A java library for parsing Self-Defining Text Archive and Retrieval (STAR) files. These form the basis of CIF and mmCIF formats.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 21
    Java flat text parser. Parse any delimited or fixed width strings. Bind user friendly column names to text positions via XML. Easily parse strings, dates, doubles, integers, etc.
    Downloads: 0 This Week
    Last Update:
    See Project
  • Previous
  • You're on page 1
  • Next