Showing 232 open source projects for "web extract"

View related business solutions
  • Red Hat Enterprise Linux on Microsoft Azure Icon
    Red Hat Enterprise Linux on Microsoft Azure

    Deploy Red Hat Enterprise Linux on Microsoft Azure for a secure, reliable, and scalable cloud environment, fully integrated with Microsoft services.

    Red Hat Enterprise Linux (RHEL) on Microsoft Azure provides a secure, reliable, and flexible foundation for your cloud infrastructure. Red Hat Enterprise Linux on Microsoft Azure is ideal for enterprises seeking to enhance their cloud environment with seamless integration, consistent performance, and comprehensive support.
  • Total Network Visibility for Network Engineers and IT Managers Icon
    Total Network Visibility for Network Engineers and IT Managers

    Network monitoring and troubleshooting is hard. TotalView makes it easy.

    This means every device on your network, and every interface on every device is automatically analyzed for performance, errors, QoS, and configuration.
  • 1
    Web Spider, Web Crawler, Email Extractor

    Web Spider, Web Crawler, Email Extractor

    Free Extracts Emails, Phones and custom text from Web using JAVA Regex

    In Files there is WebCrawlerMySQL.jar which supports MySql Connection Free Web Spider & Crawler. Extracts Information from Web by parsing millions of pages. Store data into Derby Database and data are not being lost after force closing the spider. - Free Web Spider , Parser, Extractor, Crawler - Extraction of Emails , Phones and Custom Text from Web - Export to Excel File - Data Saved into Derby and MySQL Database - Written in Java Cross Platform Also See Free email Sender...
    Leader badge
    Downloads: 103 This Week
    Last Update:
    See Project
  • 2
    Homebrew App Store

    Homebrew App Store

    GUI for downloading/managing homebrew apps for video game consoles

    A Chesto-based graphical frontend to the get package manager for downloading and managing homebrew on video game consoles, such as the Nintendo Switch and Wii U. This is a replacement to the older Wii U Homebrew App Store. To run this program, a Nintendo Switch with access to the Homebrew Menu is required. This can be done on most Switches manufactured before July 2018, for compatibility check your serial number at ismyswitchpatched. Download and extract the latest hb-appstore to sd:/switch...
    Downloads: 42 This Week
    Last Update:
    See Project
  • 3
    Free SAP Table Download Power Connector

    Free SAP Table Download Power Connector

    Free Download / Extract / Read Table from SAP to Excel / CSV / XML

    Download / Read any SAP table to Excel and/or CSV or XML individually or in groups! Use (simple/complex) filters . Option to schedule downloads or start via command line as well. Can replace full table download from SE16 SE16N SE16H and ties as connector with RPA e.g. Blueprism, UIPATH, Alteryx, Power Apps & Power Automate & Excel. Can also provide you SAP table downloads via its web services for e.g. Power Query and Power BI.
    Downloads: 4 This Week
    Last Update:
    See Project
  • 4
    Scrapy

    Scrapy

    A fast, high-level web crawling and web scraping framework

    Scrapy is a fast, open source, high-level framework for crawling websites and extracting structured data from these websites. Portable and written in Python, it can run on Windows, Linux, macOS and BSD. Scrapy is powerful, fast and simple, and also easily extensible. Simply write the rules to extract the data, and add new functionality if you wish without having to touch the core. Scrapy does the rest, and can be used in a number of applications. It can be used for data mining, monitoring...
    Downloads: 23 This Week
    Last Update:
    See Project
  • Business Continuity Solutions | ConnectWise BCDR Icon
    Business Continuity Solutions | ConnectWise BCDR

    Build a foundation for data security and disaster recovery to fit your clients’ needs no matter the budget.

    Whether natural disaster, cyberattack, or plain-old human error, data can disappear in the blink of an eye. ConnectWise BCDR (formerly Recover) delivers reliable and secure backup and disaster recovery backed by powerful automation and a 24/7 NOC to get your clients back to work in minutes, not days.
  • 5
    monolith

    monolith

    CLI tool for saving complete web pages as a single HTML file

    A data hoarder’s dream come true, bundle any web page into a single HTML file. You can finally replace that gazillion of open tabs with a gazillion of .html files stored somewhere on your precious little drive. Unlike the conventional “Save page as”, monolith not only saves the target document, it embeds CSS, image, and JavaScript assets all at once, producing a single HTML5 document that is a joy to store and share. If compared to saving websites with wget -mpk, this tool embeds all assets...
    Downloads: 18 This Week
    Last Update:
    See Project
  • 6
    Nativefier

    Nativefier

    Make any web page a desktop application

    Nativefier is a command-line tool designed to create a desktop app for any web site with minimal configuration. Apps are wrapped by Electron (using Chromium under the hood) in an OS executable (.app, .exe, etc) for use on Windows, macOS and Linux. Nativefier will try to determine the app name, and well as lots of other options. If desired, these options can be overwritten. For example, to override the name, nativefier --name 'My Medium App' 'medium.com' Read the API documentation or run...
    Downloads: 7 This Week
    Last Update:
    See Project
  • 7
    Goutte

    Goutte

    Goutte, a simple PHP Web Scraper

    Goutte is a screen scraping and web crawling library for PHP. Goutte provides a nice API to crawl websites and extract data from the HTML/XML responses. Goutte depends on PHP 7.1+. Add fabpot/goutte as a require dependency in your composer.json file. Create a Goutte Client instance (which extends Symfony\Component\BrowserKit\HttpBrowser). Make requests with the request() method. The method returns a Crawler object (Symfony\Component\DomCrawler\Crawler). To use your own HTTP settings, you may...
    Downloads: 3 This Week
    Last Update:
    See Project
  • 8
    Openbooks

    Openbooks

    Search and Download eBooks

    Openbooks allows you to download ebooks from irc.irchighway quickly and easily. For a complete list of features use the --help flags on all subcommands. For example openbooks cli --help or openbooks cli download --help. There are two modes; Server or CLI. In CLI mode you interact and download books through a terminal interface. In server mode the application runs as a web application that you can visit in your browser. OpenBooks server doesn't have to be hosted at the root of your webserver...
    Downloads: 4 This Week
    Last Update:
    See Project
  • 9
    jsoup

    jsoup

    Java library for working with real-world HTML

    ... attempt to create a clean parse from the HTML you provide, regardless of whether the HTML is well-formed or not. You have HTML in a Java String, and you want to parse that HTML to get at its contents, or to make sure it's well formed, or to modify it. The String may have come from user input, a file, or from the web.
    Downloads: 2 This Week
    Last Update:
    See Project
  • Recruit and Manage your Workforce Icon
    Recruit and Manage your Workforce

    Evolia makes it easier to hire, schedule and track time worked by frontline in medium and large-sized businesses.

    Evolia is a web and mobile platform that connects enterprises with 1000’s of local shift workers and offers free workforce scheduling and time and attendance solutions. Is your business on Evolia?
  • 10
    ScrapeGraphAI

    ScrapeGraphAI

    Python scraper based on AI

    Extracting content from websites and local documents using LLM. ScrapeGraphAI is a web scraping python library that uses LLM and direct graph logic to create scraping pipelines for websites and local documents (XML, HTML, JSON, Markdown, etc.). Just say which information you want to extract and the library will do it for you.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 11
    crawley

    crawley

    The unix-way web crawler

    Crawls web pages and prints any link it can find. Fast HTML SAX-parser (powered by golang.org/x/net/html) Small (below 1500 SLOC), idiomatic, 100% test-covered codebase. Grabs most of useful resources URLs (pics, videos, audios, forms, etc...) Found URLs are streamed to stdout and guaranteed to be unique (with fragments omitted) Scan depth (limited by starting host and path, by default - 0) can be configured. Can crawl rules and sitemaps from robots.txt. Brute mode - scan HTML comments for URLs...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 12
    Tarsier

    Tarsier

    Vision utilities for web interaction agents

    At Reworkd, we iterated on all these problems across tens of thousands of real web tasks to build a powerful perception system for web agents... Tarsier! In the video below, we use Tarsier to provide webpage perception for a minimalistic GPT-4 LangChain web agent. Tarsier visually tags interactable elements on a page via brackets + an ID e.g. [23]. In doing this, we provide a mapping between elements and IDs for an LLM to take actions upon (e.g. CLICK [23]). We define interactable elements...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 13
    fastdup

    fastdup

    An unsupervised and free tool for image and video dataset analysis

    fastdup is a powerful free tool designed to rapidly extract valuable insights from your image & video datasets. Assisting you to increase your dataset images & labels quality and reduce your data operations costs at an unparalleled scale.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 14
    Rust Data Analysis

    Rust Data Analysis

    Rust for data analysis encyclopedia (WIP)

    Welcome to the Rust Data Analysis repository! This collection of Jupyter notebooks provides a comprehensive exploration of data analysis using Rust. Powered by a Rust kernel, these notebooks allow you to dive deep into the realm of data analysis, leveraging the capabilities of the Rust programming language. With the help of various Rust libraries, such as ndarray, plotters, and more, you'll be able to extract valuable insights from different datasets with ease.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 15
    Cloud Commander

    Cloud Commander

    Cloud Commander file manager for the web with console and editor

    Cloud Commander is a file manager for the web. It includes a command-line console and a text editor. Cloud Commander helps you manage your server and work with files, directories and programs in a web browser from any computer, mobile or tablet. Can be used local or remotely. Adapts to screen size. 3 built-in editors with support of syntax highlighting: Dword, Edward and Deepword. Console with support of the default OS command line. Written in JavaScript/Node.js. Built-in archives pack: zip...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 16
    Critical

    Critical

    Extract & Inline Critical-path CSS in HTML pages

    ... rule inclusion). Generate critical-path CSS with multiple resolutions. This is a useful option when you e.g. want to defer the loading of web fonts or background images. Generate critical-path CSS and specify asset rebase behavior. Inline critical-path CSS using filamentgroup's loadCSS. Pass an object to configure inline-critical.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 17
    Web Spider, Web Crawler, Email Extractor

    Web Spider, Web Crawler, Email Extractor

    Free Extracts Emails, Phones and custom text from Web using JAVA Regex

    In Files there is WebCrawlerMySQL.jar which supports MySql Connection Please follow this link to get latest version https://sourceforge.net/projects/web-spider-web-crawler-extract/ Free Web Spider & Crawler. Extracts Information from Web by parsing millions of pages. Store data into Derby OR MySQL Database and data are not being lost after force closing the spider. - Free Web Spider , Parser, Extractor, Crawler - Extraction of Emails , Phones and Custom Text from Web - Export...
    Downloads: 10 This Week
    Last Update:
    See Project
  • 18

    Rapid Reference

    An extension that allows for hassle-free website citation/referencing.

    Please do not distribute with the goal of selling my program. How to attach to your Chrome/Edge/Brave etc Browser: 1. Download the extension(rapidreference.zip) 2. Extract 3. Go to chrome://extensions if on Chrome, or navigate to your extension management setting in your browser 4. Enable developer mode (usually top right) 5. Add unpacked extension 6. Choose the extracted extension's folder 7. There you go! How to use: 1. Start a session in the panel of the extension...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 19
    Beautiful Web Fonts in Various Languages

    Beautiful Web Fonts in Various Languages

    Including Demo Example Webpage & Professional Documenting Application

    ..., TraditionalChineseHW, Kannada, Khmer, Main Latin, Cyrillic, Greek, Malayalam, Myanmar, Oriya, Punjabi, Punjabi-Gurmukhi, Sinhala, Tamil, Telugu, Thai, Tibetan, Urdu, Cyrillic (ext), Greek (ext), Latin (ext), Vietnamese Please extract this 2.5 GB ZIP file and follow the instructions in the "1. README_FIRST.txt" file inside. These are unicode fonts, which are compatible across many devices. Wish you all success in your existing / new job opportunities, that you get with this project.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 20
    Apollo CLI

    Apollo CLI

    Tooling for development and production Apollo workflows

    ... subgraphs independently. Apollo's industry-standard, open-source GraphQL clients for web, Kotlin, and iOS provide everything your apps need to fetch, cache, and interact with data from your graph. We believe in giving developers access to powerful open source tools. You will never be asked to pay for Apollo Client. Apollo Client is a way to consume a graph. Apollo, as a company, gives you multiple libraries to this that are suited to wherever you’re building your app: web, ios, and android.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 21
    Rista Web Browser

    Rista Web Browser

    Rista Web Browser 1.0.0.0

    First extract rwbrowser,zip in a folder in c drive name rwbrowser where four text files will be extraccted 1) History.txt, 2) HomePage.txt, 3) BookMarks.txt 4) SearchPage.txt. Rista Web Browser is Microsoft WebView2 CoreWebView2 (Microsoft edge runtime) component based web browser with many menus and simple functions. It is developed in visual basic dot net with msedge runtime webview2 corewebview2. It requires Microsoft Edge Runtime WebView2CoreWebView2 installed in windows 10 or 11 os. To any...
    Downloads: 2 This Week
    Last Update:
    See Project
  • 22
    PDF Split and Merge

    PDF Split and Merge

    Split and merge PDF files on any platform

    Split and merge PDF files with PDFsam, an easy-to-use desktop tool with graphical, command line and web interface.
    Leader badge
    Downloads: 406 This Week
    Last Update:
    See Project
  • 23
    Swiss File Knife

    Swiss File Knife

    One hundred command line tools in a small and portable binary.

    Create zip files, extract zip files, replace text in files, search in files using expressions, stream text editor, instant command line ftp and http server, send folder via network, copy folder excluding sub folders and files, find duplicate files, run a command on all files of a folder, split and join large files, make md5 checksum lists of files, remove tab characters, convert CR/LF, list newest or biggest files of a folder, compare folders, treesize, show first or last lines of a file, find...
    Leader badge
    Downloads: 412 This Week
    Last Update:
    See Project
  • 24
    Stirling-PDF

    Stirling-PDF

    #1 Locally hosted web application that allows you to work on PDFs

    This is a robust, locally hosted web-based PDF manipulation tool using Docker. It enables you to carry out various operations on PDF files, including splitting, merging, converting, reorganizing, adding images, rotating, compressing, and more. This locally hosted web application has evolved to encompass a comprehensive set of features, addressing all your PDF requirements. Stirling PDF does not initiate any outbound calls for record-keeping or tracking purposes. All files and PDFs exist...
    Leader badge
    Downloads: 117 This Week
    Last Update:
    See Project
  • 25
    WFDownloader App

    WFDownloader App

    Free batch downloader for image, wallpaper, video, audio, document,

    Use as an image gallery, wallpaper, audio/music, video, document, and other media bulk downloader from supported websites. Also use to download sequential website urls that have a certain pattern (e.g. image01.png to image100.png). Also use app's built-in site crawler for advanced link search or extraction. There is also special support for forum media and open directory downloading. It's a programmable downloader and also works with password protected sites. Say goodbye to downloading one...
    Leader badge
    Downloads: 114 This Week
    Last Update:
    See Project
  • Previous
  • You're on page 1
  • 2
  • 3
  • 4
  • 5
  • Next