Showing 98 open source projects for "python text parser"

View related business solutions
  • MongoDB Atlas runs apps anywhere Icon
    MongoDB Atlas runs apps anywhere

    Deploy in 115+ regions with the modern database for every enterprise.

    MongoDB Atlas gives you the freedom to build and run modern applications anywhere—across AWS, Azure, and Google Cloud. With global availability in over 115 regions, Atlas lets you deploy close to your users, meet compliance needs, and scale with confidence across any geography.
    Start Free
  • Our Free Plans just got better! | Auth0 Icon
    Our Free Plans just got better! | Auth0

    With up to 25k MAUs and unlimited Okta connections, our Free Plan lets you focus on what you do best—building great apps.

    You asked, we delivered! Auth0 is excited to expand our Free and Paid plans to include more options so you can focus on building, deploying, and scaling applications without having to worry about your security. Auth0 now, thank yourself later.
    Try free now
  • 1
    A simple text process library, aims to assist parsing all kinds of text including plain text, XML, HTML, etc., which means it can be used as a simple XML parser or a HTML parser.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 2
    Stand-alone Java library implementing parser/formatter/comparator/validator for JSON/XML-like text formats oriented on JSON-like object model (list,map,scalar + reflection). Library is designed to maximize adaptivity via set of extendable modules.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 3
    Meresco is both an OAI Data Provider and a Service Provider. SourceForge is only used to host the source control (subversion). Sources: http://sources.meresco.org/ Binaries: http://repository.cq2.org/ Mail: http://groups.google.com/group/meresco
    Downloads: 0 This Week
    Last Update:
    See Project
  • 4
    Wiko, the wiki compiler, compiles wiki like files into html and LaTeX, combining easy wiki syntax, your preferred non-web text editor and svn/cvs control to write static webs, cientific articles or even blogs.
    Downloads: 0 This Week
    Last Update:
    See Project
  • Keep company data safe with Chrome Enterprise Icon
    Keep company data safe with Chrome Enterprise

    Protect your business with AI policies and data loss prevention in the browser

    Make AI work your way with Chrome Enterprise. Block unapproved sites and set custom data controls that align with your company's policies.
    Download Chrome
  • 5

    LIstFOrmatCONverter

    A tool to convert an exported text file from one format to another.

    Sometimes an upgraded application changes data formats which can break compatibility with previous versions. If import/export text files are of the following format: "descript1","descript2","descript3" "data1","data2","data3" "data1","data2","data3" then this program can rearrange the data of large exported files in order to be imported into another version or application with little effort.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 6

    HTML DOM Parser

    HTML parser which can be used for screen-scraping applications

    htmldom parses the HTML file and provides methods for iterating and searching the parse tree in a similar way as Jquery. To report bugs please mail me at bhimsen.pes@gmail.com
    Downloads: 0 This Week
    Last Update:
    See Project
  • 7
    Hylt is a file format that provides HYperLinked Text with minimal markup, meant to be both easy to view in any program and easy to keep in a version control system. The Hylt project covers the file specification, viewers, and tools to manage Hylt files.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 8
    PROJECT HAS MOVED: https://github.com/wiki2beamer
    Downloads: 0 This Week
    Last Update:
    See Project
  • 9

    ExiProcessor

    Command-line program for processing Efficient XML Interchange (EXI)

    ExiProcessor is a command-line program that encodes text XML files into binary EXI and decodes EXI files into XML. It uses the open source Java-based library EXIficient (http://exificient.sourceforge.net) as the EXI parser. In essence, ExiProcessor is a command-line interface to EXIficient. ExiProcessor can help people learn about the various EXI encoding and decoding options and how those options affect compression ratios.
    Downloads: 2 This Week
    Last Update:
    See Project
  • Build Securely on Azure with Proven Frameworks Icon
    Build Securely on Azure with Proven Frameworks

    Lay a foundation for success with Tested Reference Architectures developed by Fortinet’s experts. Learn more in this white paper.

    Moving to the cloud brings new challenges. How can you manage a larger attack surface while ensuring great network performance? Turn to Fortinet’s Tested Reference Architectures, blueprints for designing and securing cloud environments built by cybersecurity experts. Learn more and explore use cases in this white paper.
    Download Now
  • 10
    This is a Data Serialization Language intended to represent a hierarchial data as a text (of ANY charset). ANY "binary" or "textual" data may be represented or marked-up as well. Extremely low redundance. Very fast parser (is similar to SAX).
    Downloads: 0 This Week
    Last Update:
    See Project
  • 11
    Html Assembler
    Html Assembler is a static site generator. It automatically integrates page content such as text and photos in a modifiable page template creating a complete set of html files ready for upload to your site.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 12
    A set of Unix command line tools for quick and convenient batch processing of tabular text files (a.k.a., tab-delimited, csv, or flat file format) with a header line. Provides delimiter and compression detection, column reference by name. * tblmap: per-line ("map") computation: derive columns through an expression, delete, reorder, filter rows. * tblred: compute ("reduce") aggregations (e.g., sum, average) over groups defined by key columns. * tbldesc: Summarize columns in file...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 13

    X2S

    A simple DOM parser in Java

    X2S is a very simple DOM parser. The aim is to retrieve attributes and text-values from an XML DOM document in the case that you don't need all features and complexity of XPath and/or XQuery. For example, if it could be useful a method to retrieve all text-values from a DOM relative path like: /root-node/persons/person/name or the attributes from a string like: /root-node/persons/person/id
    Downloads: 0 This Week
    Last Update:
    See Project
  • 14

    knowceans

    Utility classes from maps to search engine to random samplers

    .... --- Highlights: --- org.knowceans.util: IndexQuickSort, TableList: apply order of one array/list to others +++ Vectors, ArrayUtils: array convenience +++ RandomSamplers, CokusRandom, ArmSampler, Densities: random sampling and distributions +++ Arguments: command line parser +++ StopWatch, Which, ExternalProcess: runtime stuff +++ ParallelFor: OpenMP workalike +++ PatternString, NamedGroupRegex: regex convenience --- org.knowceans.corpus: CorpusSearcher: full-text search engine +++ LabelNumCorpus: svmlight corpus storage and filtering +++ NIPS corpus with text, authors, labels and citations --- org.knowceans.map: InvertibleHashMultiMap, BijectiveHashMap: implement n:m and 1:1 relations. --- Other libs: knowceans-arms = port of the Adaptive Rejection Metropolis Sampler (ARMS) for arbitrary distributions +++ lda-j = port of lda-c, implementing Latent Dirichlet Allocation (LDA)
    Downloads: 0 This Week
    Last Update:
    See Project
  • 15
    Picxml is a tool for xml files management. Used as command: Picxml is a shell friendly, xpath like query tool. Used as library: picxml, is a full featured xml Parser, wich allows you to navigate, find/create/update/delete/print nodes.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 16
    SamChanEd
    SamChanEd is a command line tool to organize channels list on Samsung TV. Currently it supports only analog channels on C series of TV sets. TV icon by http://cemagraphics.deviantart.com/
    Downloads: 0 This Week
    Last Update:
    See Project
  • 17
    Converts a wide variety of different character encodings in text files to unicode. Batch mode to make converting many documents at once possible.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 18
    Multiple format bibliography processor in Python.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 19
    creole/c is a Wiki Creole parser and a HTML converter. It implements Wiki Creole 1.0 and almost all of its additions. The parser is written in C++ and has a simple event driven plain C API. The converter is a stand-alone console application.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 20
    This is a pure-Python XPath evaluator based on ElementTree. It supports a substantial fraction of the XPath 1.0 specification, but only the self, child, and attribute axes. The parser underlying the evaluator attempts to handle all of XPath 1.0.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 21
    A parser for patch (diff) files written as a Python module around the GNU patch code.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 22
    A python package for interpreting METAR coded weather reports
    Downloads: 0 This Week
    Last Update:
    See Project
  • 23
    Advanced PHP library to pase xml documents to array with many advanced options.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 24
    A program to assist in writing Arabic text in programs that don't support Arabic, such as Photoshop (the non-ME editions), Scribus, Flash, and others. Source code is hosted on github: http://github.com/hasenj/arabic-writer
    Downloads: 0 This Week
    Last Update:
    See Project
  • 25
    A JavaScript library for parsing Creole 1.0 wiki markup.
    Downloads: 0 This Week
    Last Update:
    See Project