Showing 133 open source projects for "python text parser"

View related business solutions
  • Go From AI Idea to AI App Fast Icon
    Go From AI Idea to AI App Fast

    One platform to build, fine-tune, and deploy ML models. No MLOps team required.

    Access Gemini 3 and 200+ models. Build chatbots, agents, or custom models with built-in monitoring and scaling.
    Try Free
  • $300 in Free Credit Towards Top Cloud Services Icon
    $300 in Free Credit Towards Top Cloud Services

    Build VMs, containers, AI, databases, storage—all in one place.

    Start your project in minutes. After credits run out, 20+ products include free monthly usage. Only pay when you're ready to scale.
    Get Started
  • 1
    OOoPy is a library in Python for inspecting, creating or modifying OpenOffice.org documents. It uses the existing ElementTree XML library by Fredrik Lundh for manipulation of the OOo XML.
    Leader badge
    Downloads: 0 This Week
    Last Update:
    See Project
  • 2

    SEO Tool

    Seo Tool: AI Autoblogger for ArtikelSchreiber.com and UNAIQUE.net

    Seo Tool: AI Autoblogger for https://www.artikelschreiber.com/ and https://www.unaique.net/ Function: Create Mini site based on Config File on your server https://linktr.ee/textgenerator Use a Shared Hosting Server Change the "seo-marketing-tool.conf" config to fit your needs (eg: Create a Mini Site about "Cars" then change the AI Software API Keywords to "car") Start the Tool "python3 seo-marketing-tool.py" Your mini site will be created on your Shared Hosting Server HTML5...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 3
    googler

    googler

    Google from the terminal

    googler is a power tool to Google (web, news, videos and site search) from the command line. It shows the title, URL and abstract for each result, which can be directly opened in a browser from the terminal. Results are fetched in pages (with page navigation). Supports sequential searches in a single googler instance. googler was initially written to cater to headless servers without X. You can integrate it with a text-based browser. However, it has grown into a very handy and flexible...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 4
    QPyDesk

    QPyDesk

    Code editor and real-time QR code generator for QPython

    QPyDesk is a code editor, and real-time QR code generator for QPython. It is a Python code editor with syntax highlighting that also generates the QR code that represents said code in real time. This application also allows you to print the generated QR code to distribute the created application. However, because QR codes have a limited storage capacity, if the code is very long, QPyDesk creates a QR code that is only valid while the application is running, that is, the QR code generated...
    Downloads: 0 This Week
    Last Update:
    See Project
  • Train ML Models With SQL You Already Know Icon
    Train ML Models With SQL You Already Know

    BigQuery automates data prep, analysis, and predictions with built-in AI assistance.

    Build and deploy ML models using familiar SQL. Automate data prep with built-in Gemini. Query 1 TB and store 10 GB free monthly.
    Try Free
  • 5
    googler

    googler

    Google Search, Google Site Search, Google News from the terminal

    googler is a power tool to Google (Web & News) and Google Site Search from the command-line. It shows the title, URL and abstract for each result, which can be directly opened in a browser from the terminal. Results are fetched in pages (with page navigation). Supports sequential searches in a single googler instance. googler was initially written to cater to headless servers without X. You can integrate it with a text-based browser. However, it has grown into a very handy and flexible...
    Downloads: 1 This Week
    Last Update:
    See Project
  • 6
    yabasta

    yabasta

    Yet Another BAsic Scraper and Text Analysis

    YA BASTA! is a Python/R application for Lyrics Web Scraper and Text Analysis. Web scraping is developed in Python, text analysis in R as Python subprocesses. YA BASTA! is only tested on windows OS. To run YA BASTA! just type on window command prompt: python.exe yabasta.py
    Downloads: 0 This Week
    Last Update:
    See Project
  • 7
    SFM2Web reads text and database files encoded with SFMs (Standard Format Markers) and then generates a web site according to flags specified in control files. This is useful for web publication of MDF lexicons, USFM Bible books, texts, phrasebooks, etc.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 8
    Backuso

    Backuso

    Backup software for your data to local and remote SSH systems

    ...A remote storage is anything which is accessed on your network via the ssh protocol. This could be a NAS (Network Attached Storage) in your local LAN, or a server share on the internet. Backuso is configured with a simple text file. You can start the backups manually or automatically, e.g. using crontab. Backuso is based on Python 3 and uses the software rsync as its core element. Hence it runs on systems where rsync is available (Linux, FreeBSD, Apple).
    Downloads: 0 This Week
    Last Update:
    See Project
  • 9
    mzitu

    mzitu

    Python crawler that downloads image galleries and analyzes titles

    ...Using text segmentation and frequency analysis, the project can create a word cloud representing common keywords found in the dataset. This makes the repository both a scraping example and a small data analysis experiment built around the collected content. Overall, mzitu serves as a learning-oriented implementation of Python web scraping, data processing, and visualization techniques.
    Downloads: 6 This Week
    Last Update:
    See Project
  • Streamline Azure Security with Palo Alto Networks VM-Series Icon
    Streamline Azure Security with Palo Alto Networks VM-Series

    Centrally manage physical and virtualized firewalls with Panorama

    Improve your security posture and reduce incident response time. Use the VM-Series to natively analyze Azure traffic and dynamically drive policy updates based on workload changes.
    Learn more
  • 10
    uncaptcha

    uncaptcha

    Defeating Google's audio reCaptcha with 85% accuracy

    uncaptcha is an open-source proof-of-concept system designed to demonstrate vulnerabilities in Google’s audio reCAPTCHA challenges by automatically solving them using speech recognition techniques. The project uses browser automation to navigate to CAPTCHA challenges, extract audio files, and process them through multiple speech-to-text services. By combining outputs from several transcription engines, the system increases the likelihood of correctly identifying the spoken digits or phrases...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 11
    Microblog

    Microblog

    Microblogging application developed in my Flask Mega-Tutorial series

    Microblog is a Flask-based social blogging application that serves as the companion example project to the author’s Flask tutorial book, demonstrating how to build a production-ready web application with features typical of modern social systems. The project includes user registration and login, password hashing, profile editing, and avatar support, but it goes deeper by illustrating how to implement following/follower relationships, posting short messages, and displaying personalized feeds....
    Downloads: 0 This Week
    Last Update:
    See Project
  • 12
    DSTK - DataScience ToolKit

    DSTK - DataScience ToolKit

    DSTK - DataScience ToolKit for All of Us

    DSTK - DataScience ToolKit is an opensource free software for statistical analysis, data visualization, text analysis, and predictive analytics. Newer version and smaller file size can be found at: https://sourceforge.net/projects/dstk3/ It is designed to be straight forward and easy to use, and familar to SPSS user. While JASP offers more statistical features, DSTK tends to be a broad solution workbench, including text analysis and predictive analytics features.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 13
    htmlarea

    htmlarea

    Small, powerful, full featured WYSIWYG editor

    HTMLArea 4 is a browser based WYSIWYG editor that easily replaces the TEXTAREA in your web pages. It is written in JavaScript, and suitable for use in any modern web browser, and any page on your web site. Current version is 4.0-2016-08-29
    Downloads: 0 This Week
    Last Update:
    See Project
  • 14
    Bifrozt

    Bifrozt

    High interaction honeypot solution for Linux based systems

    NOTICE: The format of this project has been changed from ISO to using ansible and has been moved to GitHub. Github link: https://github.com/Bifrozt/bifrozt-ansible
    Downloads: 0 This Week
    Last Update:
    See Project
  • 15
    That project aims at providing a clean API, and the corresponding C++ implementation, for parsing travel-focused requests (e.g., "washington dc beijing monday r/t +aa -ua 1 week 2 adults 1 dog").
    Downloads: 0 This Week
    Last Update:
    See Project
  • 16

    pyFreeDnsUpdate

    FreeDNS update python script

    pyFreeDnsUpdate is a command line python script that updates the FreeDNS record for a given domain to the current public facing IP address detected by the local host. It requires a one time edit to configure a list of domain|api_key pairs, and after that it can be run as a chron job to perform a sync check and update push if needed. Activity is logged on a local text file, and DEBUG level can be enabled in case of troubles.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 17
    huhamhire-hosts

    huhamhire-hosts

    hosts for Internet Freedom

    Since the governments of some countries are using the blocking the internet access to several websites and web service providers which includes some world famous sites like Google, YouTube, twitter, Facebook, and Wikipedia etc., we designed this tiny utility in order to help people getting through the Internet blockade. Hosts Setup Utility provides basic tools to manage the hosts file on current operating systems. It also provides both support for Graphical Desktop environment with...
    Downloads: 2 This Week
    Last Update:
    See Project
  • 18

    sitecheck

    Modular web site spider for web developers.

    More than just a link checker, sitecheck is a website spider (also known as a crawler) which can assist with SEO by testing an entire site plus both inbound links from search engines and outbound links to other sites for the following issues: looping redirects (HTTP 301/302), broken links (HTTP 404), server errors (HTTP 500), spelling mistakes, low readability scores (using the Flesch Reading Ease test), missing/empty/duplicate meta tags, duplicate content, slow page speed, W3C validation...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 19

    SE Auditor

    Free SEO audit software.

    SE Auditor is a program for analyzing web pages for search engines. SE Auditor is application that you can use to view statistical data about your website, in order to improve its position within the Web search results. SE Auditor is addressed to SEO professionals, website designers, developers, website testers and owners. SE Auditor enables you to check meta description, keywords, sitemap, the number of links and keyword consistency, the text/HTML ratio and many more ranking /...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 20
    Barq Download Manager
    Barq Download Manager is a GUI for aria2 download manager written in PySide
    Downloads: 0 This Week
    Last Update:
    See Project
  • 21
    PynDora

    PynDora

    Python WebServer Log File Analyzer

    This is a web log file analyzer we are making using python. First the IIS parsing engine wil be built and then Apache and possibly other servers. It is going to support multiple log files from any date and output the statistics in html formatted files, incorporating automatically build charts. It will be a pure python solution which is going to be self contained, ie no installation will be required other from the standard python modules.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 22

    URL Grabber

    Grab the all links in a page.

    URL Grabber is a windows tool used to capture all the links in the website or webpage. And Store it in a text file.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 23
    Html SymboliZe

    Html SymboliZe

    transcodes between html entities and regular text

    Hsz takes the text you type and turns it into the proper html entities. Hsz is designed to make web developing easier by providing an easy means of looking up html entity codes. (see http://www.w3schools.com/html/html_entities.asp for info about what html entity codes are)
    Downloads: 0 This Week
    Last Update:
    See Project
  • 24
    the maker - Content Management System
    "the maker" is a Content Management System for websites. It is client side and interacts seamlessly with a remote FTP server. It features a powerful text editor, has a build in FTP client and a tool for basic image manipultaion.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 25
    Haproxy Time

    Haproxy Time

    Haproxy connection time log analyser

    A log parser for Haproxy that provide connection time statistics. Haproxy log parser Embeded email reporting
    Downloads: 0 This Week
    Last Update:
    See Project
MongoDB Logo MongoDB