Showing 84 open source projects for "python text parser"

View related business solutions
  • Gen AI apps are built with MongoDB Atlas Icon
    Gen AI apps are built with MongoDB Atlas

    Build gen AI apps with an all-in-one modern database: MongoDB Atlas

    MongoDB Atlas provides built-in vector search and a flexible document model so developers can build, scale, and run gen AI apps without stitching together multiple databases. From LLM integration to semantic search, Atlas simplifies your AI architecture—and it’s free to get started.
    Start Free
  • Keep company data safe with Chrome Enterprise Icon
    Keep company data safe with Chrome Enterprise

    Protect your business with AI policies and data loss prevention in the browser

    Make AI work your way with Chrome Enterprise. Block unapproved sites and set custom data controls that align with your company's policies.
    Download Chrome
  • 1

    ExiProcessor

    Command-line program for processing Efficient XML Interchange (EXI)

    ExiProcessor is a command-line program that encodes text XML files into binary EXI and decodes EXI files into XML. It uses the open source Java-based library EXIficient (http://exificient.sourceforge.net) as the EXI parser. In essence, ExiProcessor is a command-line interface to EXIficient. ExiProcessor can help people learn about the various EXI encoding and decoding options and how those options affect compression ratios.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 2
    Html Assembler
    Html Assembler is a static site generator. It automatically integrates page content such as text and photos in a modifiable page template creating a complete set of html files ready for upload to your site.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 3

    X2S

    A simple DOM parser in Java

    X2S is a very simple DOM parser. The aim is to retrieve attributes and text-values from an XML DOM document in the case that you don't need all features and complexity of XPath and/or XQuery. For example, if it could be useful a method to retrieve all text-values from a DOM relative path like: /root-node/persons/person/name or the attributes from a string like: /root-node/persons/person/id
    Downloads: 1 This Week
    Last Update:
    See Project
  • 4

    knowceans

    Utility classes from maps to search engine to random samplers

    .... --- Highlights: --- org.knowceans.util: IndexQuickSort, TableList: apply order of one array/list to others +++ Vectors, ArrayUtils: array convenience +++ RandomSamplers, CokusRandom, ArmSampler, Densities: random sampling and distributions +++ Arguments: command line parser +++ StopWatch, Which, ExternalProcess: runtime stuff +++ ParallelFor: OpenMP workalike +++ PatternString, NamedGroupRegex: regex convenience --- org.knowceans.corpus: CorpusSearcher: full-text search engine +++ LabelNumCorpus: svmlight corpus storage and filtering +++ NIPS corpus with text, authors, labels and citations --- org.knowceans.map: InvertibleHashMultiMap, BijectiveHashMap: implement n:m and 1:1 relations. --- Other libs: knowceans-arms = port of the Adaptive Rejection Metropolis Sampler (ARMS) for arbitrary distributions +++ lda-j = port of lda-c, implementing Latent Dirichlet Allocation (LDA)
    Downloads: 0 This Week
    Last Update:
    See Project
  • Our Free Plans just got better! | Auth0 Icon
    Our Free Plans just got better! | Auth0

    With up to 25k MAUs and unlimited Okta connections, our Free Plan lets you focus on what you do best—building great apps.

    You asked, we delivered! Auth0 is excited to expand our Free and Paid plans to include more options so you can focus on building, deploying, and scaling applications without having to worry about your security. Auth0 now, thank yourself later.
    Try free now
  • 5
    Picxml is a tool for xml files management. Used as command: Picxml is a shell friendly, xpath like query tool. Used as library: picxml, is a full featured xml Parser, wich allows you to navigate, find/create/update/delete/print nodes.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 6
    Converts a wide variety of different character encodings in text files to unicode. Batch mode to make converting many documents at once possible.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 7
    WSFuzzer
    WSFuzzer is a fuzzing penetration testing tool used against HTTP SOAP based web services. It tests numerous aspects (input validation, XML Parser, etc) of the SOAP target. It is only to be used against targets that have granted permission to be teste
    Downloads: 6 This Week
    Last Update:
    See Project
  • 8
    Multiple format bibliography processor in Python.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 9
    creole/c is a Wiki Creole parser and a HTML converter. It implements Wiki Creole 1.0 and almost all of its additions. The parser is written in C++ and has a simple event driven plain C API. The converter is a stand-alone console application.
    Downloads: 0 This Week
    Last Update:
    See Project
  • Cloud-based help desk software with ServoDesk Icon
    Cloud-based help desk software with ServoDesk

    Full access to Enterprise features. No credit card required.

    What if You Could Automate 90% of Your Repetitive Tasks in Under 30 Days? At ServoDesk, we help businesses like yours automate operations with AI, allowing you to cut service times in half and increase productivity by 25% - without hiring more staff.
    Try ServoDesk for free
  • 10
    This is a pure-Python XPath evaluator based on ElementTree. It supports a substantial fraction of the XPath 1.0 specification, but only the self, child, and attribute axes. The parser underlying the evaluator attempts to handle all of XPath 1.0.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 11
    A parser for patch (diff) files written as a Python module around the GNU patch code.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 12
    A python package for interpreting METAR coded weather reports
    Downloads: 0 This Week
    Last Update:
    See Project
  • 13
    Advanced PHP library to pase xml documents to array with many advanced options.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 14
    A program to assist in writing Arabic text in programs that don't support Arabic, such as Photoshop (the non-ME editions), Scribus, Flash, and others. Source code is hosted on github: http://github.com/hasenj/arabic-writer
    Downloads: 0 This Week
    Last Update:
    See Project
  • 15
    A JavaScript library for parsing Creole 1.0 wiki markup.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 16
    Contains a LaTeX style file and an associated GUI that allow for the annotation of LaTeX documents. Tracks changes made by multiple editors. This package provides a way for multiple authors to collaboratively edit a latex document.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 17
    Sofu is a simple Unicode-based plain text file format designed for easily accessing and storing application data, such as game databases and configuration files. This project also hosts various Sofu parser libraries.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 18
    Txt2tags converts a text file with minimal markup to HTML, XHTML, SGML, LaTeX, Lout, UNIX Man Page, Wikipedia, Google Code Wiki, DokuWiki, MoinMoin, MagicPoint(mgp), PageMaker. Features: simple, fast, automatic TOC, macros, filters, include, GUI/CLI/
    Downloads: 0 This Week
    Last Update:
    See Project
  • 19
    ZML, the Zeitung Markup Language, is a simple CMS for small newspapers. It was specifically designed to publish a student newspaper in print and on the Web. It uses LaTeX and XHTML. So far, it is documented in German only.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 20
    Some text parser, that transforms a custom TeX-like syntax into XML. Un parseur, qui produit du XML à partir d'une syntaxe ressemblant à TeX.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 21
    Pybtex is a drop-in replacement for BibTeX written in Python.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 22
    csvtoxml will convert parse csv comma separated value data into xml. a command line console utility that uses stdin and stdout pipe with more cat, pr, wget, zip, find -exec for added functionality. file stream term c c++ small fast parser unix win osx
    Downloads: 0 This Week
    Last Update:
    See Project
  • 23
    LiMa means Lightweight Markup Language. It is a parser for an easy to use ASCII/Text-based markup - comparable to Markdown or the Wikipedia-Markup language with special configurable extensions in defining Links and image-resources.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 24
    Markout is a pure-Java lightweight wiki markup parser based on John Gruber's Markdown.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 25
    Wtparser is a lex/yacc-based parser for wikitext as defined by mediawiki.
    Downloads: 0 This Week
    Last Update:
    See Project