Showing 77 open source projects for "python text parser"

View related business solutions
  • Keep company data safe with Chrome Enterprise Icon
    Keep company data safe with Chrome Enterprise

    Protect your business with AI policies and data loss prevention in the browser

    Make AI work your way with Chrome Enterprise. Block unapproved sites and set custom data controls that align with your company's policies.
    Download Chrome
  • Gen AI apps are built with MongoDB Atlas Icon
    Gen AI apps are built with MongoDB Atlas

    The database for AI-powered applications.

    MongoDB Atlas is the developer-friendly database used to build, scale, and run gen AI and LLM-powered apps—without needing a separate vector database. Atlas offers built-in vector search, global availability across 115+ regions, and flexible document modeling. Start building AI apps faster, all in one place.
    Start Free
  • 1
    chemfiles

    chemfiles

    Modern library for chemistry file reading and writing

    Chemfiles is a modern and high-quality library for reading and writing trajectory files created by computational chemistry simulations program. These trajectories contains atomic positions, velocities, names, topology and sometimes more. Running simulations produce enormous amounts of data, which has to be post-processed to extract physical information about the simulated system. Chemfiles provides an interface to access this information which is - unified: the same code will work with...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 2
    Python in neuroscience

    Python in neuroscience

    Collection of tutorials about methods of computational neuroscience

    "Python in neuroscience" is a collection of collaboratively edited tutorials on computational neuroscience methods using Python language. Each tutorial is presented as a literate programming tutorial mixing text, code, and figures. The tutorials offer an introduction to basic methods of computational neuroscience and scientific programming in Python.
    Downloads: 2 This Week
    Last Update:
    See Project
  • 3
    aeneas

    aeneas

    Automagically synchronize audio and text (aka forced alignment)

    aeneas is a Python/C library and a set of tools to automagically synchronize audio and text (aka forced alignment). aeneas automatically generates a synchronization map between a list of text fragments and an audio file containing the narration of the text. In computer science this task is known as (automatically computing a) forced alignment.
    Downloads: 10 This Week
    Last Update:
    See Project
  • 4

    TFieldedText

    Fielded Text (CSV) file parser/generator

    TFieldedText is a component which allows you to easily: generate and parse Fielded Text Files (eg. CSV files); and create and edit Fielded Text Meta files. For more information about the Fielded Text standard see http://www.fieldedtext.org
    Downloads: 1 This Week
    Last Update:
    See Project
  • Our Free Plans just got better! | Auth0 Icon
    Our Free Plans just got better! | Auth0

    With up to 25k MAUs and unlimited Okta connections, our Free Plan lets you focus on what you do best—building great apps.

    You asked, we delivered! Auth0 is excited to expand our Free and Paid plans to include more options so you can focus on building, deploying, and scaling applications without having to worry about your security. Auth0 now, thank yourself later.
    Try free now
  • 5

    deemon

    Deemon Scripting Language

    A rich and powerful scripting language supporting the c preprocessor, classes, exceptions, lambdas, yield functions, label, switch-statement, pointers, foreign functions, and much more. With a fully packed standard library including support for file-io, threads, atomics, pipes, math, file-system, sockets, randomization, hashing, serialization, a wide selection of emulated c/c++ headers and more, deemon provides a suitable environment for any application. Syntax is easy to read and...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 6
    Question Answering Corpus

    Question Answering Corpus

    Question answering dataset in "Teaching Machines to Read & Comprehend"

    RC-Data is a dataset generation framework created by Google DeepMind to produce large-scale reading comprehension question-answer pairs from CNN and Daily Mail news articles. The dataset, introduced in the 2015 paper “Teaching Machines to Read and Comprehend” (Hermann et al., NIPS 2015), was among the first large corpora designed to train and evaluate machine reading and comprehension models. The repository provides scripts for downloading archived CNN and Daily Mail articles from the...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 7
    Knwl.js

    Knwl.js

    A .js library for parsing text for specific information

    Knwl.js is a Javascript library that parses through text for dates, times, phone numbers, emails, places, and more. Parser plugins are what make Knwl.js tick and give it new parsing abilities. There are many already available under ./default_plugins for a number of tasks. If you're interested in developing plugins for Knwl.js, ./plugin_development.md is a great place to start. You can optionally specify a language in the first parameter.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 8
    Gumbo

    Gumbo

    An HTML5 parsing library in pure C99

    Gumbo is an implementation of the HTML5 parsing algorithm implemented as a pure C99 library with no outside dependencies. It's designed to serve as a building block for other tools and libraries such as linters, validators, templating languages, and refactoring and analysis tools. Gumbo gains some of this by virtue of being written in C, but it is not an important consideration for the intended use-case, and was not a major design factor. Gumbo is intentionally designed to turn an HTML...
    Downloads: 1 This Week
    Last Update:
    See Project
  • 9

    HTML XHTML Parser + XPath

    Delphi HTML XHTML Parser +XPath

    Delphi HTML Parser This module lets you work with HTML documents as DOM tree and use XPath for searching tags. It is very simple way to parse HTML. This tested with version Delphi XE5,6 Usage Add in Uses parser.pas; begin HtmlTxt:= ''; //here your html NodeList:= TNodeList.Create; ValueList:= TStringList.Create; DomTree:= TDomTree.Create; DomTreeNode:= DomTree.RootNode; If DomTreeNode.RunParse(HtmlTxt) then begin {your code example: DomTreeNode.FindXPath('//*[@id="TopBox"]/div[1]/div[@class="draw default"]'),NodeList,ValueList)} end; end; Xpath support: attributes - //*[@id="TopBox"]/div/@class comment - //*[@id="TopBox"]/div/comment()[3] text - //*[@id="TopBox"]/div/text()[2] previous level - /.....
    Downloads: 14 This Week
    Last Update:
    See Project
  • Failed Payment Recovery for Subscription Businesses Icon
    Failed Payment Recovery for Subscription Businesses

    For subscription companies searching for a failed payment recovery solution to grow revenue, and retain customers.

    FlexPay’s innovative platform uses multiple technologies to achieve the highest number of retained customers, resulting in reduced involuntary churn, longer life span after recovery, and higher revenue. Leading brands like LegalZoom, Hooked on Phonics, and ClinicSense trust FlexPay to recover failed payments, reduce churn, and increase customer lifetime value.
    Learn More
  • 10
    IronMeta is an implementation of Alessandro Warth's OMeta metaprogramming system in C#. It provides a packrat parser generator that generates parsers for Parsing Expression Grammars that operate on arbitrary streams of objects.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 11
    Python Tkinter Extensions

    Python Tkinter Extensions

    Additional Graphical Classes and Widgets for Tkinter/Ttk

    This is a Python package with extensions for Tkinter. It supports both Python 2.7 and Python 3.3. Currently, the project is in version 1.0 of the Production stage. Feel free to fork off the main project or send me possible additions to the package.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 12

    Javascript Moony Parser

    library: parsing text by given grammar in javascript

    This is a javascript library for parsing texts. The library accepts minimalistic moony grammar language and parses given text by the grammar. It is an implementation of "Earley" parser which makes it context free grammar complete. Instructions are bundled in download.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 13
    TextBlob

    TextBlob

    TextBlob is a Python library for processing textual data

    Simple, Pythonic, text processing, Sentiment analysis, part-of-speech tagging, noun phrase extraction, translation, and more. It provides a simple API for diving into common natural language processing (NLP) tasks such as part-of-speech tagging, noun phrase extraction, sentiment analysis, classification, translation, and more. TextBlob stands on the giant shoulders of NLTK and pattern, and plays nicely with both. Supports word inflection (pluralization and singularization) and lemmatization,...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 14
    Voikko

    Voikko

    Library of linguistic tools

    Voikko is a spell checking, grammar checking, morphological analysis and hyphenation system. Spell checkers are available for multiple languages, other features for Finnish only.
    Leader badge
    Downloads: 17 This Week
    Last Update:
    See Project
  • 15
    Stand-alone Java library implementing parser/formatter/comparator/validator for JSON/XML-like text formats oriented on JSON-like object model (list,map,scalar + reflection). Library is designed to maximize adaptivity via set of extendable modules.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 16

    HTML DOM Parser

    HTML parser which can be used for screen-scraping applications

    htmldom parses the HTML file and provides methods for iterating and searching the parse tree in a similar way as Jquery. To report bugs please mail me at bhimsen.pes@gmail.com
    Downloads: 0 This Week
    Last Update:
    See Project
  • 17
    MongoLantern - MongoDB Fulltext Search

    MongoLantern - MongoDB Fulltext Search

    Open Source MongoDB Fulltext Search Server

    MongoLantern is an open source full text search server using MongoDB as index storage, which allows MongoLantern to migrate any changes very easily into account using MongoDB API. It's written originally written in PHP can be migrated to any desired language as required using it's future APIs. MongoLantern 0.7 - Stable/Production Release: 1. MongoLantern API support enabled. 2. CSV indexer added as a plugin. 3. node.js API client added.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 18

    X2S

    A simple DOM parser in Java

    X2S is a very simple DOM parser. The aim is to retrieve attributes and text-values from an XML DOM document in the case that you don't need all features and complexity of XPath and/or XQuery. For example, if it could be useful a method to retrieve all text-values from a DOM relative path like: /root-node/persons/person/name or the attributes from a string like: /root-node/persons/person/id
    Downloads: 1 This Week
    Last Update:
    See Project
  • 19

    knowceans

    Utility classes from maps to search engine to random samplers

    .... --- Highlights: --- org.knowceans.util: IndexQuickSort, TableList: apply order of one array/list to others +++ Vectors, ArrayUtils: array convenience +++ RandomSamplers, CokusRandom, ArmSampler, Densities: random sampling and distributions +++ Arguments: command line parser +++ StopWatch, Which, ExternalProcess: runtime stuff +++ ParallelFor: OpenMP workalike +++ PatternString, NamedGroupRegex: regex convenience --- org.knowceans.corpus: CorpusSearcher: full-text search engine +++ LabelNumCorpus: svmlight corpus storage and filtering +++ NIPS corpus with text, authors, labels and citations --- org.knowceans.map: InvertibleHashMultiMap, BijectiveHashMap: implement n:m and 1:1 relations. --- Other libs: knowceans-arms = port of the Adaptive Rejection Metropolis Sampler (ARMS) for arbitrary distributions +++ lda-j = port of lda-c, implementing Latent Dirichlet Allocation (LDA)
    Downloads: 0 This Week
    Last Update:
    See Project
  • 20
    Parser Delphi class. Extendable parser that processes numbers (int/hex/float), text constants in quotation marks (with '' \' \n, \t, \u0000, etc), brackets, lists (comma separated elements), skips whitespaces and comments like //, /* */
    Downloads: 2 This Week
    Last Update:
    See Project
  • 21
    A collection of open source libraries and tools that provide solutions for common problems in processing Arabic text, especially in web applications. text normalization, phrase segmentation, text indexing, stop word lists, common spelling mistakes.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 22
    C# .NET library implementing the Pop3 message retrieval protocol
    Downloads: 0 This Week
    Last Update:
    See Project
  • 23
    Python API for accessing 4chan image boards. Provides posting, reading, and tripcoding capabilities.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 24
    CTypes-based python bindings to the GSASL library (www.gnu.org/software/gsasl ). This package provides a parser that generates python code w/ types checking and more pythonish feel than just plain usage of the library through ctypes.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 25
    Svg parser is a C++ library, that you can use to parse SVG documents to build object model, you can work with. It's very small, so you don't need much time to build it. You can easily integrate it to your applications because of some accessibilities.
    Downloads: 0 This Week
    Last Update:
    See Project