Search Results for "html source extractor" - Page 6

Showing 518 open source projects for "html source extractor"

View related business solutions
  • Earn up to 16% annual interest with Nexo. Icon
    Earn up to 16% annual interest with Nexo.

    Access competitive interest rates on your digital assets.

    Generate interest, borrow against your crypto, and trade a range of cryptocurrencies — all in one platform. Geographic restrictions, eligibility, and terms apply.
    Get started with Nexo.
  • Train ML Models With SQL You Already Know Icon
    Train ML Models With SQL You Already Know

    BigQuery automates data prep, analysis, and predictions with built-in AI assistance.

    Build and deploy ML models using familiar SQL. Automate data prep with built-in Gemini. Query 1 TB and store 10 GB free monthly.
    Try Free
  • 1
    MahaKurawa.My.ID URL Extractor

    MahaKurawa.My.ID URL Extractor

    MahaKurawa.My.ID URL Extractor is Simple Tool to extract unique URL

    MahaKurawa.My.ID URL Extractor is Simple Tool to extract unique URL from any text content in instant. It's useful when you lazy enough to identify and copy-paste URL from your content one by one by yourself.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 2
    Graphtage

    Graphtage

    A semantic diff utility and library for tree-like files such as JSON

    Graphtage is a command-line utility and underlying library for semantically comparing and merging tree-like structures, such as JSON, XML, HTML, YAML, plist, and CSS files. Its name is a portmanteau of “graph” and “graftage”, the latter being the horticultural practice of joining two trees together such that they grow as one. Graphtage performs an analysis on an intermediate representation of the trees that is divorced from the filetypes of the input files. This means, for example, that you...
    Downloads: 1 This Week
    Last Update:
    See Project
  • 3
    ddgr

    ddgr

    DuckDuckGo from the terminal

    ddgr is a command-line search tool that lets users perform DuckDuckGo web searches directly from their terminal, providing a privacy-focused alternative to browser-based searches without tracking or personalized profiling. It fetches search results via DuckDuckGo’s API or HTML output and presents links, snippets, and metadata in a clean terminal format, making it useful for programmers, sysadmins, and privacy advocates who prefer keyboard-driven workflows. The tool also supports options like...
    Downloads: 7 This Week
    Last Update:
    See Project
  • 4
    sqlitebiter

    sqlitebiter

    A CLI tool to convert CSV / Excel / HTML / JSON / Jupyter Notebook

    A CLI tool to convert CSV / Excel / HTML / JSON / Jupyter Notebook / LDJSON / LTSV / Markdown / SQLite / SSV / TSV / Google-Sheets to an SQLite database file.
    Downloads: 1 This Week
    Last Update:
    See Project
  • Custom VMs From 1 to 96 vCPUs With 99.95% Uptime Icon
    Custom VMs From 1 to 96 vCPUs With 99.95% Uptime

    General-purpose, compute-optimized, or GPU/TPU-accelerated. Built to your exact specs.

    Live migration and automatic failover keep workloads online through maintenance. One free e2-micro VM every month.
    Try Free
  • 5
    PGRColorPicker

    PGRColorPicker

    Easy and smart classic color picker

    Color picker app with support of multi type format: HTML, RGB, HSV, HSL, and CMYK format. Alpha channel support with preview and bidirectional copy/paste and drag&drop mode. Also has auto-copy feature for intesity and percent color level with/without type declaration. Multi custom profile management.
    Downloads: 2 This Week
    Last Update:
    See Project
  • 6
    PipeRider

    PipeRider

    Code review for data in dbt

    PipeRider automatically compares your data to highlight the difference in impacted downstream dbt models so you can merge your Pull Requests with confidence. PipeRider can profile your dbt models and obtain information such as basic data composition, quantiles, histograms, text length, top categories, and more. PipeRider can integrate with dbt metrics and present the time-series data of metrics in the report. PipeRider generates a static HTML report each time it runs, which can be viewed...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 7

    Knobjex Info Manager

    Tool for PIM, mind-mapping, quality-management, knowledge-base.

    Also available for knowledge-enthusiasts: https://github.com/some-avail/freekwensie Knobjex 5.01 released partially; windows setup forthcoming; go to tab "Files" to download . Added shortcut keys (chapter 5.3) , dark themes better supported, removed bugs. No database-changes. Knobjex (short for Knowledge Objects) is an information-manager. It has many potential uses, such as calendar, task-list and sticky notes. Knobjex can also handle more advanced use-cases such as...
    Downloads: 7 This Week
    Last Update:
    See Project
  • 8
    DeepSearch5Plus

    DeepSearch5Plus

    Search recursively all files, text inside files, and bookmarks

    Files, text, and bookmarks html/json search tool. It is able to merge duplicated bookmark links from multiple html/json files and export to html. The user interface is designed to be used by kyeboard in any aspect of its features scrolling list, select input field, menu bar, check and drop-down menu and so on. It's able to printout search list to pdf or system printer or save as text file. It allows to rename single file or change upper/lower case, join spaces in bulk mode with multiple...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 9
    Demucs

    Demucs

    Code for the paper Hybrid Spectrogram and Waveform Source Separation

    Demucs (Deep Extractor for Music Sources) is a deep-learning framework for music source separation—extracting individual instrument or vocal tracks from a mixed audio file. The system is based on a U-Net-like convolutional architecture combined with recurrent and transformer elements to capture both short-term and long-term temporal structure. It processes raw waveforms directly rather than spectrograms, allowing for higher-quality reconstruction and fewer artifacts in separated tracks. ...
    Downloads: 120 This Week
    Last Update:
    See Project
  • Try Google Cloud Risk-Free With $300 in Credit Icon
    Try Google Cloud Risk-Free With $300 in Credit

    No hidden charges. No surprise bills. Cancel anytime.

    Use your credit across every product. Compute, storage, AI, analytics. When it runs out, 20+ products stay free. You only pay when you choose to.
    Start Free
  • 10

    Bookmark to url

    Save browser bookmarks to single shortcut file

    Python program that transfers all browser bookmarks to single windows .url shortcuts, organized in their respective folders, and with corresponding icons. Simply export your bookmarks in html file and open it. Bugs: some link's name, with special characters, can cause problems with icons and links in main root are copied in last folder.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 11
    nodejsscan

    nodejsscan

    nodejsscan is a static security code scanner for Node.js applications

    Static security code scanner (SAST) for Node.js applications powered by libsast and semgrep. nodejsscan is a static security code scanner for Node.js applications.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 12
    PanConvert

    PanConvert

    Gui for the Document converter pandoc

    A cross platform gui application for the document converter pandoc. Some converters like html, opml, markdown and latex can be conveniently used by just clicking and then converting the loaded or typed text. Some more advanced functions of pandoc have to be typed in as parameters.
    Downloads: 34 This Week
    Last Update:
    See Project
  • 13
    dirhunt

    dirhunt

    Web crawler that finds hidden web directories without brute force

    Dirhunt is an open source security tool designed to discover web directories and analyze website structures without relying on brute-force techniques. Instead of sending large numbers of guess-based requests, it operates as a specialized crawler that intelligently explores websites to identify accessible or hidden directories. Dirhunt can detect directories that expose “Index Of” listings, which may reveal files and other resources that were not intended to be publicly visible. ...
    Downloads: 6 This Week
    Last Update:
    See Project
  • 14
    Amazon Braket Strawberry Fields Plugin

    Amazon Braket Strawberry Fields Plugin

    An open source framework for using Amazon Braket devices

    An open-source framework for using Amazon Braket devices with the Strawberry Fields photonic device programming library. This plugin provides a BraketEngine class for running photonic quantum circuits created in Strawberry Fields on the Amazon Braket service. The Amazon Braket Python SDK is an open source library that provides a framework to interact with quantum computing hardware devices and simulators through Amazon Braket. This plugin provides the classes BraketEngine for submitting...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 15
    footswitch3

    footswitch3

    Audio Transcription software for Linux (Gstreamer) with a foot pedal

    Footswitch 3 is a media player for transcribers on Linux. Written in python using the python bindings for Gstreamer it allows a transcriber to control the audio or video with a foot pedal, and includes a set of macros that integrate into LibreOffice. This allows the transcriber to control the media player from within Libreoffice as well, making it useful for those who do not yet own a foot pedal/foot switch. Control of the media player from LibreOffice can be via Hotkeys or an integrated...
    Downloads: 5 This Week
    Last Update:
    See Project
  • 16

    Kanbanara

    Web-based Project Management System using the Kanban methodology

    Kanbanara, a web-based Project Management System, uses the Kanban methodology. Started in 2013, the product is already quite well advanced. It is written in Python 3.7+ and utilizes MongoDB and CherryPy. Its Kanban board features projects, user-definable workflow with custom states, support for epic, feature, story, enhancement, defect, task, test, bug and transient cards, global and personal WIP limits, role-based columns (Owner, Reviewer or Quality Assurance), support for ghost cards...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 17
    DeepCTR-Torch

    DeepCTR-Torch

    Easy-to-use,Modular and Extendible package of deep-learning models

    DeepCTR-Torch is an easy-to-use, Modular and Extendible package of deep-learning-based CTR models along with lots of core components layers that can be used to build your own custom model easily.It is compatible with PyTorch.You can use any complex model with model.fit() and model.predict(). With the great success of deep learning, DNN-based techniques have been widely used in CTR estimation tasks. The data in the CTR estimation task usually includes high sparse,high cardinality categorical...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 18
    OpsManage

    OpsManage

    Automated operation and maintenance platform

    Automated operation and maintenance platform: code and application deployment CI/CD, asset management CMDB, scheduled task management platform, SQL audit | rollback, task scheduling, on-site WIKI. A code deployment, application deployment, scheduled tasks, and equipment asset management platform. Welcome to star or fork my open source project. If you need to quote the project code in your own project, please declare the agreement and copyright information in the project. Programming language, Python3.6 + HTML + JScripts. Front-end web framework, Bootstrap. Backend Web Framework, Django. Backend Task Framework, Celery + Redis. Programming language is Python 3.6. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 19
    Pattern

    Pattern

    Web mining module for Python, with tools for scraping

    Pattern is an open-source Python library that provides tools for web mining, natural language processing, machine learning, and network analysis. The project integrates multiple capabilities into a single framework that allows developers to collect, process, and analyze textual data from the web. It includes modules for web scraping and crawling that can retrieve information from sources such as social media platforms, search engines, and online knowledge bases. In addition to data mining...
    Downloads: 11 This Week
    Last Update:
    See Project
  • 20
    ediViewer

    ediViewer

    View, Edit and extract transactions from (PESC) standard Edi Files.

    View, Edit and extract edi transactions from PESC standard approved Edi files. The EdiViewer has been tested to work with the following [PESC standards] (https://www.pesc.org/pesc-approved-standards-1.html) 1. TS 189 Application for Admission to Educational Institutions 2. TS 130 Educational Record (Transcript)
    Downloads: 2 This Week
    Last Update:
    See Project
  • 21
    AutoScraper

    AutoScraper

    A Smart, Automatic, Fast and Lightweight Web Scraper for Python

    This project is made for automatic web scraping to make scraping easy. It gets a URL or the HTML content of a web page and a list of sample data that we want to scrape from that page. This data can be text, URL or any HTML tag value of that page. It learns the scraping rules and returns similar elements. Then you can use this learned object with new URLs to get similar content or the exact same element of those new pages.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 22

    CCTV Frame Timestamp Extractor

    CCTV Footage Timestamp Search Tool

    Python script to address the problem of manually locating required event timestamps from carved CCTV DVR footages. Full details can be found in the paper published in Eighteenth Annual IFIP WG 11.9 International Conference on Digital Forensics. Link to paper: https://link.springer.com/chapter/10.1007/978-3-031-10078-9_8 The project has been divided into four modules: Framextract.py- Extracts frames from video footages Reconstruct.py- Attempts to repair unplayable video by extracting...
    Downloads: 2 This Week
    Last Update:
    See Project
  • 23
    Amazon Braket Ocean Plugin

    Amazon Braket Ocean Plugin

    A Python plugin for using Ocean with Amazon Braket

    The Amazon Braket Ocean Plugin is an open-source library in Python that provides a framework that you can use to interact with Ocean tools on top of Amazon Braket. Before you begin working with the Amazon Braket Ocean Plugin, make sure that you've installed or configured the following prerequisites. Download and install Python 3.7.2 or greater from Python.org. If you are using Windows, choose Add Python to environment variables before you begin the installation. Make sure that your AWS...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 24
    TreeLine stores almost any kind of information in an organized tree structure. Each node in the tree can contain several fields, forming a mini-database.
    Downloads: 10 This Week
    Last Update:
    See Project
  • 25
    Recursive Sans & Mono

    Recursive Sans & Mono

    Recursive Mono & Sans is a variable font family for code & UI

    Recursive Sans & Mono is a variable type family built for better code & UI. It is inspired by casual script signpainting, but designed primarily to meet the needs of programming environments and application interfaces. In programming, “recursion” is when a function calls itself, using its own output as an input to yield powerful results. Recursive Mono was used as a tool to help build itself: it was used to write Python scripts to automate type production work and to generate specimen...
    Downloads: 0 This Week
    Last Update:
    See Project
MongoDB Logo MongoDB