Search Results for "html source extractor" - Page 5

Showing 518 open source projects for "html source extractor"

View related business solutions
  • Gemini 3 and 200+ AI Models on One Platform Icon
    Gemini 3 and 200+ AI Models on One Platform

    Access Google's best plus Claude, Llama, and Gemma. Fine-tune and deploy from one console.

    Build generative AI apps with Vertex AI. Switch between models without switching platforms.
    Start Free
  • $300 in Free Credit Towards Top Cloud Services Icon
    $300 in Free Credit Towards Top Cloud Services

    Build VMs, containers, AI, databases, storage—all in one place.

    Start your project in minutes. After credits run out, 20+ products include free monthly usage. Only pay when you're ready to scale.
    Get Started
  • 1
    LangChain Extract

    LangChain Extract

    Did you say you like data?

    LangChain Extract is an open-source reference application designed to demonstrate how large language models can be used to extract structured data from unstructured text and document files. The project implements a lightweight web service that allows developers to define extraction schemas and apply them to various sources such as plain text, HTML, or PDF documents.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 2
    pypandoc

    pypandoc

    Thin wrapper for "pandoc" (MIT)

    Pypandoc provides a thin wrapper for pandoc, a universal document converter. Pypandoc uses pandoc, so it needs an available installation of pandoc. Pypandoc provides 2 packages, "pypandoc" and "pypandoc_binary", with the second one including pandoc out of the box. The 2 packages are identical, with the only difference being that one includes pandoc, while the other don't. If pandoc is already installed (i.e. pandoc is in the PATH), pypandoc uses the version with the higher version number,...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 3
    bridgex

    bridgex

    Convert files like docx, xlsx, pptx, html, and more to MarkDown

    ...Supported Formats 📂 Bridgex supports conversion of the following file formats: - PDF (.pdf) - Word (.docx) - PowerPoint (.pptx) - Excel (.xlsx, .xls, .csv) - Outlook Messages (.msg) - Text (.txt, .text) - Markdown (.md, .markdown) - JSON (.json, .jsonl) - XML (.xml) - RSS/Atom (.rss, .atom) - HTML/MHTML (.html, .htm, .mhtml) - ePub (.epub) - Compressed files (.zip) - Jupyter Notebooks (.ipynb) - Other formats supported by Markitdown Bridgex is not an IDE, text editor, Markdown editor, or document viewer
    Downloads: 6 This Week
    Last Update:
    See Project
  • 4
    Whakerexa

    Whakerexa

    A minimalist and lightweight web kit for accessible contents

    `Whakerexa` provides a lightweight, modular set of CSS and JavaScript tools for building accessible, consistent, and customizable web interfaces. It is intended to be as simple as possible to make **accessible web content**, and to minimize the use of CSS classes for enhancing the readability of HTML code. It was designed to be easily customizable, allowing users to adjust properties such as fonts, colors, borders, etc., effortlessly. Most of the properties are stored into variables which makes possible to re-define them, then to obtain a custom different style, enabling users to achieve a unique style easily. It can be combined with the use of WhakerPy, an open source library to create dynamic HTML content: <https://whakerpy.sf.net>. ...
    Downloads: 2 This Week
    Last Update:
    See Project
  • AI-generated apps that pass security review Icon
    AI-generated apps that pass security review

    Stop waiting on engineering. Build production-ready internal tools with AI—on your company data, in your cloud.

    Retool lets you generate dashboards, admin panels, and workflows directly on your data. Type something like “Build me a revenue dashboard on my Stripe data” and get a working app with security, permissions, and compliance built in from day one. Whether on our cloud or self-hosted, create the internal software your team needs without compromising enterprise standards or control.
    Try Retool free
  • 5
    WhakerKit

    WhakerKit

    A seamless toolkit to manage dynamic websites and shared documents

    WhakerKit is a versatile toolkit for building websites with both static and dynamic HTML pages, developed by Brigitte Bigi, CNRS. WhakerKit offers seamless management of public and authenticated access, and simplifies document sharing for collaborative environments. It is based on the following technologies: * python >= 3.9 * (optional) PyJWT and ldap3 for authentication (install with pip) * WhakerPy >= 1.3: <https://whakerpy.sourceforge.io> (install with pip) * Whakerexa >=...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 6
    TextTest is an application-independent tool for text-based functional testing. This means running a batch-mode binary in lots of different ways, and using the text output produced as a means of controlling the behaviour of that application.
    Leader badge
    Downloads: 85 This Week
    Last Update:
    See Project
  • 7
    Mastering Bitcoin

    Mastering Bitcoin

    Mastering Bitcoin 3rd Edition - Programming the Open Blockchain

    The bitcoinbook repository contains the source code for Mastering Bitcoin, the authoritative open-source book by Andreas M. Antonopoulos on Bitcoin and cryptocurrency technologies. Written in a collaborative and continuously updated format using Markdown and AsciiDoc, the book serves as a comprehensive technical guide for developers, engineers, and system architects who want to understand how Bitcoin works. It covers the protocol, cryptography, peer-to-peer architecture, wallets, mining, and...
    Downloads: 11 This Week
    Last Update:
    See Project
  • 8
    Wapiti

    Wapiti

    Wapiti is a web-application vulnerability scanner

    Wapiti is a vulnerability scanner for web applications. It currently search vulnerabilities like XSS, SQL and XPath injections, file inclusions, command execution, XXE injections, CRLF injections, Server Side Request Forgery, Open Redirects... It use the Python 3 programming language.
    Leader badge
    Downloads: 133 This Week
    Last Update:
    See Project
  • 9
    csv2odf

    csv2odf

    csv2odf can convert csv data to formatted spreadsheets and documents.

    ...It can be used to create spreadsheets and documents for LibreOffice, OpenOffice, Microsoft Office Excel and Word. It is open source GPL v3 and crossplatform, it can run on most operating systems that can run Python (Python is required). More details, example files, and online manual at http://csv2odf.sf.net.
    Downloads: 10 This Week
    Last Update:
    See Project
  • Enterprise-grade ITSM, for every business Icon
    Enterprise-grade ITSM, for every business

    Give your IT, operations, and business teams the ability to deliver exceptional services—without the complexity.

    Freshservice is an intuitive, AI-powered platform that helps IT, operations, and business teams deliver exceptional service without the usual complexity. Automate repetitive tasks, resolve issues faster, and provide seamless support across the organization. From managing incidents and assets to driving smarter decisions, Freshservice makes it easy to stay efficient and scale with confidence.
    Try it Free
  • 10

    cfe

    Framework for producing web-based database applications

    Sophisticated environment for developing web-based applications. It includes lots of handy low-level components including: 1/ mySql interface, including table creation, tables as class, query as iterator 2/ html generation 3/ multi-lingual ability 4/ html form field handling 5/ etc. On top of this is a high-level framework that allows rapid development of database apps including seach/create/display/update, user navigation between related entities and much more. Included, as a...
    Downloads: 1 This Week
    Last Update:
    See Project
  • 11
    Small Python library with various things such as Configuration file parsing (in Python syntax), HTML and PDF parsing. Used in others of my projects.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 12
    LexiFinder

    LexiFinder

    AI-powered semantic indexing: automating the creation of book indexes

    ...Both interfaces share the same underlying engine and support the same features. Supported input formats are PDF, DOCX, and ODT. The index can be exported as plain text, JSON, CSV, or HTML.
    Downloads: 2 This Week
    Last Update:
    See Project
  • 13
    Forex Assistant

    Forex Assistant

    Manage accounts, analyze trades, track strategies & stats , News & Cal

    Forex Assistant is a comprehensive Windows desktop application designed to support forex traders in every aspect of their trading journey. From essential trading calculators and market insights to powerful account and trade management, it brings everything you need into one intuitive platform. 🔑 Main Features 🧮 Calculator Tools Profit/Loss Calculator Currency Converter Position Size Calculator Pips Calculator Margin Calculator 🌍 Market Insights Live Market News Economic...
    Downloads: 3 This Week
    Last Update:
    See Project
  • 14
    justniffer
    justniffer is a TCP sniffer. It reassembles and reorders packets and displays the tcp flow in a customizable way. It can log network traffic in web server log format. It can also log network services performances (e.g. web server response times) and extract http content (images, html, scripts, etc)
    Downloads: 2 This Week
    Last Update:
    See Project
  • 15
    Buku

    Buku

    Powerful command-line bookmark manager. Your mini web!

    buku is a powerful bookmark manager written in Python3 and SQLite3. buku fetches the title of a bookmarked web page and stores it along with any additional comments and tags. You can use your favourite editor to compose and update bookmarks. With multiple search options, including regex and a deep scan mode (particularly for URLs), it can find any bookmark instantly. Multiple search results can be opened in the browser at once. Though a terminal utility, it's possible to add bookmarks...
    Downloads: 15 This Week
    Last Update:
    See Project
  • 16
    Libro

    Libro

    An interactive program for statistical analysis of texts

    A cross-platform text analysis program written in Python and Free Pascal/Lazarus which scans a whole text file (in plain text, HTML, EPUB, or ODT formats) and ranks all used words according to frequency, performing a quantitative analysis of the text using Shannon-Weaver information statistic and Zipf power law function. It counts words, sentences, chars, spaces, and syllables. Also computes readability indexes (Gunning-Fog, Coleman-Liau, Automated Readability Index (ARI), SMOG grade,...
    Downloads: 4 This Week
    Last Update:
    See Project
  • 17
    AI Bulk Mailer

    AI Bulk Mailer

    Phần mềm gửi email hàng loạt với tạo nội dung bằng AI

    Phần mềm gửi email hàng loạt với khả năng hỗ trợ đa luồng, tạo nội dung tự động bằng nhiều mô hình AI và thu thập tất cả email trên một trang web.
    Downloads: 3 This Week
    Last Update:
    See Project
  • 18
    ufonet

    ufonet

    UFONet - Denial of Service Toolkit

    UFONet - Is a set of hacktivist tools that allow launching coordinated DDoS and DoS attacks and combine both in a single offensive. It also works as an encrypted DarkNET to publish and receive content by creating a global client/server network based on a direct-connect P2P architecture. + FAQ: https://ufonet.03c8.net/FAQ.html -------------------------------------------- -> UFONet-v1.8 [DPh] "DarK-PhAnT0m!" (.zip) -> md5 = [ c8ab016f6370c8391e2e6f9a7cbe990a ] -> UFONet-v1.8...
    Downloads: 10 This Week
    Last Update:
    See Project
  • 19
    Dominate

    Dominate

    Dominate is a Python library for creating and manipulating HTML docs

    Dominate is a Python library for creating and manipulating HTML documents using an elegant DOM API. It allows you to write HTML pages in pure Python very concisely, which eliminates the need to learn another template language, and lets you take advantage of the more powerful features of Python. Dominate can also use keyword arguments to append attributes onto your tags. Most of the attributes are a direct copy from the HTML spec with a few variations. Through the use of the += operator and...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 20
    Hero Planner

    Hero Planner

    An easy to use GUI for creating team rotas and scheduling absences.

    HERO Planner – Unleash Your Scheduling Superpowers Is rostering your kryptonite? Transform chaos into control with HERO Planner, the command center for retail, hospitality, and service teams. This modern desktop app turns managers into scheduling superheroes—no cape required. Your Utility Belt: ✓ Super Speed: Intuitive drag-and-drop & bulk actions for lightning-fast updates. ✓ Compliance Shield: Real-time alerts guard against labour law violations and overtime risks. ✓ X-Ray Vision:...
    Downloads: 5 This Week
    Last Update:
    See Project
  • 21
    3tdb

    3tdb

    A web based MariaDB client.

    LINUX ONLY - requires mod_python Python MariaDB client. ** 12.2-0 ** 2026-01-10 mod_python was removed from debian trixie (13). As of this writing, mod_python can be compiled from git. 3tp3 uses minimal javascript, which you can examine in the js folder (one file, 36 lines). 3tp3 does not load any external (internet or other) code. No external fonts, helper code or advertisement. It can be installed and run on localhost.
    Downloads: 2 This Week
    Last Update:
    See Project
  • 22
    IPyPlot

    IPyPlot

    Fast and efficient plotting of images inside Python Notebooks

    IPyPlot is a small python package offering fast and efficient plotting of images inside Python Notebooks. It's using IPython with HTML for faster, richer and more interactive way of displaying big numbers of images.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 23
    myScite

    myScite

    The allRound pocket sized CodeEditor.

    .... -- Features -- - Full MinGW and GTK SDKs Autocomplete.(190+) - Do system scripting (bash, applescript, cmd, powershell, perl, j/vbscript, awk) - Examine all sorts of data files (sql, regedit, mib, xml, yaml, json, vcard ...) - Review difference and patch files - Create makefiles (gnu make / cmake) - Edit html, css and config files (with calltips) - Describe circuits in vhdl and spice. ... - And finally; read & write source code: - [ Syntax highlighted ] - go, vala, pike, swift, flash, ch, rust - [ Calltip assisted ] - c/cpp11, js&jQuery, python, php, ruby, lua, c#, java, perl --Others-- - Restructured config files with inline docs - Scriptable via lua Extension...
    Leader badge
    Downloads: 2 This Week
    Last Update:
    See Project
  • 24
    CAMPARI

    CAMPARI

    Software for molecular simulations and trajectory analysis

    We are proud to introduce version 5 of CAMPARI. We have added a number of new features, most notably a Python interface for interpreting user-supplied code (with the help of ForPy), a novel trajectory storage standard (with the help of libpqxx/PostgreSQL), and a module for performing transition path theory. Naturally, CAMPARI continues to provide the reference implementation of the ABSINTH force field paradigm and implicit solvation model. CAMPARI is a joint package for performing and...
    Downloads: 4 This Week
    Last Update:
    See Project
  • 25
    CC2.TV / CC2 - Audio- und TV-Datenbank

    CC2.TV / CC2 - Audio- und TV-Datenbank

    Meta-Datenbank-Anwendung für die Audio- und TV-Sendungen des CC2.TV

    Dieses Programm stellt eine Meta-Datenbank-Anwendung für die Audio- und Video-Sendungen des CC2.TV für GNU/Linux Systeme zur Verfügung. Es ermöglicht das Durchsuchen, Verwalten und Abspielen der umfangreichen Inhalte des CC2.TV-Audiocasts und -Videocasts. Ziel ist es, die über 3000 Audiocast-Themen und über 1000 Videocast-Themen, die sich auf Computerthemen, Technik und gesellschaftliche Aspekte konzentrieren, komfortabel zugänglich zu machen. Für die volle Funktionalität,...
    Downloads: 1 This Week
    Last Update:
    See Project
MongoDB Logo MongoDB