Showing 17 open source projects for "text parser"

View related business solutions
  • Auth0 Free: 25K MAUs + 5-Min Setup Icon
    Auth0 Free: 25K MAUs + 5-Min Setup

    Enterprise Auth, Zero Friction: Any Framework • 30+ SDKs • Universal Login

    Production-ready login in 10 lines of code. SSO, MFA & social auth included. Scale seamlessly beyond free tier with Okta’s enterprise security.
    Get Your API Keys
  • Never Get Blocked Again | Enterprise Web Scraping Icon
    Never Get Blocked Again | Enterprise Web Scraping

    Enterprise-Grade Proxies • Built-in IP Rotation • 195 Countries • 20K+ Companies Trust Us

    Get unrestricted access to public web data with our ethically-sourced proxy network. Automated session management and advanced unblocking handle the hard parts. Scale from 1 to 1M requests with zero blocks. Built for developers with ready-to-use APIs, serverless functions, and complete documentation. Used by 20,000+ companies including Fortune 500s. SOC2 and GDPR compliant.
    Get Started
  • 1
    tika-python

    tika-python

    Python binding to the Apache Tika™ REST services

    A Python port of the Apache Tika library that makes Tika available using the Tika REST Server. This makes Apache Tika available as a Python library, installable via Setuptools, Pip and easy to install. To use this library, you need to have Java 7+ installed on your system as tika-python starts up the Tika REST server in the background. To get this working in a disconnected environment, download a tika server file (both tika-server.jar and tika-server.jar.md5, which can be found here) and set...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 2
    LlamaParse

    LlamaParse

    Parse files for optimal RAG

    LlamaParse is a GenAI-native document parser that can parse complex document data for any downstream LLM use case (RAG, agents). Load in 160+ data sources and data formats, from unstructured, and semi-structured, to structured data (API's, PDFs, documents, SQL, etc.) Store and index your data for different use cases. Integrate with 40+ vector stores, document stores, graph stores, and SQL db providers.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 3
    Walrus

    Walrus

    Lightweight Python utilities for working with Redis

    ..., ranges, less/greater-than, and a basic full-text search index. The full-text search features a boolean search query parser, porter stemmer, stop-word filtering, and optional double-metaphone implementation.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 4
    pdf-editor

    pdf-editor

    Edit your PDFs without needing a subscription or creating accounts

    Edit your PDFs without needing a subscription or creating accounts. Add a GUI/Turn it into a web application. Add a parser for the command line to do multiple commands at once e.g. merge (cut pdf1) pdf2. Tested working with Python 3.8.5. Install venv (py -3.8 -m pip install virtualenv). PDF and Word documents are binary files, which makes them much more complex than plaintext files. In addition to text, they store lots of font, color, and layout information. If you want your programs to read...
    Downloads: 7 This Week
    Last Update:
    See Project
  • Top-Rated Free CRM Software Icon
    Top-Rated Free CRM Software

    216,000+ customers in over 135 countries grow their businesses with HubSpot

    HubSpot is an AI-powered customer platform with all the software, integrations, and resources you need to connect your marketing, sales, and customer service. HubSpot's connected platform enables you to grow your business faster by focusing on what matters most: your customers.
    Get started free
  • 5

    psgdump

    Dump psg/ym chip tune files to txt and midi format

    PSGDump tool is parser and converter for chip tune files. It supports PSG and YM input file formats, focusing on AY/YM chip tunes from ZX Spectrum and Atari ST. The tool produces text output of notes played and creates multi-track MIDI file.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 6
    Flick App Development

    Flick App Development

    Have you ever just wanted to, code an app, easier?

    Code an app easier with Flick, a python parser which can make your console apps easier. Even though it has only one type of snippet, it's best when you can run it to make stories, text movies, whatever! Here's a line from the creator, mainly to a blog. "I'm going to make variable-based objects soon, because you can't really modify objects."
    Downloads: 0 This Week
    Last Update:
    See Project
  • 7
    pylatexenc

    pylatexenc

    Simple LaTeX parser providing latex-to-unicode and unicode-to-latex

    Simple LaTeX parser providing latex-to-unicode and unicode-to-latex conversion. Python 3.4 or 2.7. The library is designed to be as backward-compatible as reasonably possible and is able to run on old Python versions should it be necessary. (Use the setup.py script directly if you have Python 3.7, poetry doesn't seem to work with old Python versions.) The pylatexenc.latexencode module provides a function unicode_to_latex() which converts a Unicode string into LaTeX text and escape sequences...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 8
    Bangla TTS

    Bangla TTS

    Bangla text to speech synthesis in python

    Bangla text to speech Multilingual (Bangla, English) real-time ([almost] in a GPU) speech synthesis library. Installation -------------------------------------- * Install Anaconda * conda create -n new_virtual_env python==3.6.8 * conda activate new_virtual_env * pip install -r requirements.txt * While running for the first time, keep your internet connection on to download the weights of the speech synthesis models (>500 MB) * For fast...
    Downloads: 1 This Week
    Last Update:
    See Project
  • 9
    PangView

    PangView

    Text Viewer that displays text similar to pango

    A text viewer utility that displays text in a pango like markup. The utility is written in python, and uses the pygtk toolkit. The parser reads pango tags similar to html. The utility was written, so formatted text can be displayed quickly. NEW! The parser now is updated to use a faster method of internal data structure.
    Downloads: 0 This Week
    Last Update:
    See Project
  • Deliver secure remote access with OpenVPN. Icon
    Deliver secure remote access with OpenVPN.

    Trusted by nearly 20,000 customers worldwide, and all major cloud providers.

    OpenVPN's products provide scalable, secure remote access — giving complete freedom to your employees to work outside the office while securely accessing SaaS, the internet, and company resources.
    Get started — no credit card required.
  • 10

    Nehul Singh IDE for .Net

    NS-ORG IDE for DOT NET Programming

    Also visit official website nehulsinghide.blogspot.in Writes applications in languages including C#, VB.NET, F#, IronPython and IronRuby, as well as target rich and reach: Windows Forms or WPF, as well as ASP.NET MVC and WCF. It starts from USB drives, supports read-only projects, comes with integrated unit and performance testing tools, Git, NuGet and a lot more features that make you productive as a developer.It ships with a lots of new features of code completion and snippet...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 11

    t2t-pipe

    automatic alignment pipeline for parallel treebanks

    The *Tree-to-Tree (t2t) Alignment Pipe* is a collection of python scripts, co-ordinating the process of automatic alignment of parallel treebanks from plain text files with a single call from a unix command line. Supported Languages: DE, FR, EN
    Downloads: 0 This Week
    Last Update:
    See Project
  • 12
    Sentence Parser for Python
    This is a code for the sentence parsing that does its job properly and FAST. The main problem is that you really need a database of abbreviations so that phrases such as "Dr. Smith" are not calculated as 2 sentences, which means that the good parser must be language dependent. I am also providing a list of all English abbreviations with the code. You can always tweak the code to get a nicer output, but the main idea is still there, and completed with this little program.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 13
    Java exception extractor. This utility will parse all files (either plain text or bzipped) and tries to search for various exceptions. It then tries to match exceptions against grouping rules (regexps). It is also able to group unrecognised exceptions.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 14
    A collection of python scripts to create and handle an XML corpus (a large collection of text for linguistic purpose) from an original Wikipedia database backup dump. It includes a regular expression based parser for the MediaWiki markup language.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 15
    PyINI is a cross-platform INI file parser library that can read, edit and create INI files. INI is a plain text format for storing program profile. PyINI is written in Python, and C/C++ (and other languages in the future) binding is also avaiable.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 16
    PyRtfLib is a python library that provides a parser and few translators like rtf to html and to simple text.
    Downloads: 5 This Week
    Last Update:
    See Project
  • 17
    PM2HTML takes PageMaker files and makes a cohesive newspaper website. It comprises a PMScript that exports all stories to a directory of tagged txts, and a python program to act as a converter to turn those tagged text files into HTML, a parser to guess
    Downloads: 0 This Week
    Last Update:
    See Project
  • Previous
  • You're on page 1
  • Next