Showing 22 open source projects for "web extract"

View related business solutions
  • The #1 Embedded Analytics Solution for SaaS Teams. Icon
    The #1 Embedded Analytics Solution for SaaS Teams.

    Qrvey saves engineering teams time and money with a turnkey multi-tenant solution connecting your data warehouse to your SaaS application.

    Qrvey’s comprehensive embedded analytics software enables you to design more customizable analytics experiences for your end users.
    Try Developer Playground
  • Top-Rated Free CRM Software Icon
    Top-Rated Free CRM Software

    216,000+ customers in over 135 countries grow their businesses with HubSpot

    HubSpot is an AI-powered customer platform with all the software, integrations, and resources you need to connect your marketing, sales, and customer service. HubSpot's connected platform enables you to grow your business faster by focusing on what matters most: your customers.
    Get started free
  • 1
    jsoup

    jsoup

    Java library for working with real-world HTML

    ... attempt to create a clean parse from the HTML you provide, regardless of whether the HTML is well-formed or not. You have HTML in a Java String, and you want to parse that HTML to get at its contents, or to make sure it's well formed, or to modify it. The String may have come from user input, a file, or from the web.
    Downloads: 5 This Week
    Last Update:
    See Project
  • 2
    Tarsier

    Tarsier

    Vision utilities for web interaction agents

    At Reworkd, we iterated on all these problems across tens of thousands of real web tasks to build a powerful perception system for web agents... Tarsier! In the video below, we use Tarsier to provide webpage perception for a minimalistic GPT-4 LangChain web agent. Tarsier visually tags interactable elements on a page via brackets + an ID e.g. [23]. In doing this, we provide a mapping between elements and IDs for an LLM to take actions upon (e.g. CLICK [23]). We define interactable elements...
    Downloads: 1 This Week
    Last Update:
    See Project
  • 3
    Figma Sprite Generator

    Figma Sprite Generator

    A Figma plugin to generate sprite sheets and JSON files from selected

    Figma Sprite Generator is a Figma plugin designed to create sprite sheets and corresponding JSON metadata from selected icons. This tool is ideal for designers and developers working on web projects or UI libraries, as it automates sprite generation for more efficient workflows. Users can quickly convert their icons into PNG sprites and extract JSON files with icon dimensions and positions, streamlining the process of incorporating sprites into websites or applications.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 4
    Critical

    Critical

    Extract & Inline Critical-path CSS in HTML pages

    ... rule inclusion). Generate critical-path CSS with multiple resolutions. This is a useful option when you e.g. want to defer the loading of web fonts or background images. Generate critical-path CSS and specify asset rebase behavior. Inline critical-path CSS using filamentgroup's loadCSS. Pass an object to configure inline-critical.
    Downloads: 0 This Week
    Last Update:
    See Project
  • Bright Data - All in One Platform for Proxies and Web Scraping Icon
    Bright Data - All in One Platform for Proxies and Web Scraping

    Say goodbye to blocks, restrictions, and CAPTCHAs

    Bright Data offers the highest quality proxies with automated session management, IP rotation, and advanced web unlocking technology. Enjoy reliable, fast performance with easy integration, a user-friendly dashboard, and enterprise-grade scaling. Powered by ethically-sourced residential IPs for seamless web scraping.
    Get Started
  • 5
    PDF Split and Merge

    PDF Split and Merge

    Split and merge PDF files on any platform

    Split and merge PDF files with PDFsam, an easy-to-use desktop tool with graphical, command line and web interface.
    Leader badge
    Downloads: 345 This Week
    Last Update:
    See Project
  • 6
    HelloNzb

    HelloNzb

    The Binary Usenet Tool

    With HelloNzb you can download (binary) files from Usenet servers via NZB index files. The software is based on Java and can thus run on many platforms (tested on Windows and Linux). Automatic archive verification via PAR2, automatic RAR archive extraction, built-in yEnc- and UU-decoding. Portable, no installation required.
    Downloads: 14 This Week
    Last Update:
    See Project
  • 7
    FireTeX: LaTeX Editor and Compiler

    FireTeX: LaTeX Editor and Compiler

    Edit Your files LaTeX and tex

    FireTeX, web based LaTeX editor complete, is a powerful, intuitive and stocked with useful functions for exporting the results in three useful formats. An editor with LaTeX compiler, highlight code, advanced search / replace and filesystem API HTML5. ======== Android app available on Play Store > https://play.google.com/store/apps/details?id=com.ulmdesign.ulmtex ======== Update 30.06.2017 Windows 7 and later and macOS 10.9 and later are supported. == Browser Extensions == Add-on Opera...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 8
    WebHarvest - web data extraction tool
    Web data extraction (web data mining, web scraping) tool. It leverages well proved XML and text processing techologies in order to easely extract useful data from arbitrary web pages.
    Downloads: 17 This Week
    Last Update:
    See Project
  • 9
    Cloud Export is a tool to automatically extract your data from web applications and save it to your local file system for backup purposes, but more extensive than Google Takeout. Plans are based on http://www.dataliberation.org.
    Downloads: 0 This Week
    Last Update:
    See Project
  • Translate docs, audio, and videos in real time with Google AI Icon
    Translate docs, audio, and videos in real time with Google AI

    Make your content and apps multilingual with fast, dynamic machine translation available in thousands of language pairs.

    Google Cloud’s AI-powered APIs help you translate documents, websites, apps, audio files, videos, and more at scale with best-in-class quality and enterprise-grade control and security.
    Learn More
  • 10
    Text2MP3

    Text2MP3

    PDF/Text to MP3 - Text Processing to speech

    This project is depricated. We apologize. ---------------------------------------------------------------------------- Windows Application that strips PDF's into text and converts to speech. You can save the extracted text also into text files, Word docs, csv's and rtf format. Browse for PDF's from the web, save them and strip them. Good for students, lecturers, theses and educational purposes. Some bugs yet to fix in the coming weeks, although these do not effect the functionality...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 11

    htmlpicker

    Picks up text from a web page using a html template.

    A java html picker - text extractor Picks up text from a web page using a html template. Useful if you have regularly data to extract from the same site. You may use the same url or you may build urls having parameters. These parameters are fetch from a text file.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 12
    Info Transformer is a powerful utility to maintain the information in spite of data format and storage. There are lots of high speed functions for common and special data processings. Extract, transfer, load, convert, update, match, remove and so on.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 13
    This is an advanced web scraper with user friendly GUI which let the user define rules and web addresses to extract data from one time or periodically and a target database filed that the data should be saved in.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 14
    Questo script consente di evidenziare, estrarre e condividere contenuti da una pagina web tramite la semplice selezione col mouse. This script allows you to highlight, extract and share content from a web page simply by mouse selecting.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 15
    MixDEM
    MixDEM a web based ETL tools meant for Web integration, Data transformation and Mashup edition. It include MixDEM ETL Engine created using ZEND Framework, and MixDEM GUI Editor an AJAX IDE that enable developers to quickly and easily create applications.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 16
    RecupNews is a utility to easily retrieve information from RSS feeds. More than just recovery, it will extract the data contained in the title and description of each item, also in the Web page linked to the article.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 17
    viewstate is a decoder and encoder for ASP .Net viewstate data. It supports the different viewstate data formats and can extract viewstate data direct from web pages. viewstate will also show any hash applied to the viewstate data.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 18
    phpAjaxTags is a port to PHP from advanced java tag library "AjaxTags" (ajaxtags.sourceforge.net). To install - just extract zip file into Your root www dir. On-line demo http://www.2al.pl/phpAjaxTags/
    Downloads: 0 This Week
    Last Update:
    See Project
  • 19
    Syncopate is an extension module to the Apache JMeter testing tool. It enhances JMeter's HTTP proxy server by adding functionality to extract variables and create assertions during HTTP request recording.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 20
    Web Archive Extractor intended to extract webarchive files into folder.
    Downloads: 14 This Week
    Last Update:
    See Project
  • 21

    Xidel

    Xidel is a cli webpage scraping tool supporting XPath/XQuery 3 and CSS

    Xidel is a command line tool to download web pages and extract data from them. This data can be extracted using XPath/XQuery 3.0 (with a compatibility modes for XPath 2.0 and XQuery 1.0), JSONiq, CSS 3 selectors, and custom, pattern-matching templates that are like an annotated version of the processed page. It can download files over HTTP/S connections, follow redirections, links, or extracted values, and also process local files. The extracted values can then be exported as plain...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 22
    Jdeluxe is an Eclipse Plugin that helps developers to analyse B2B interfaces based on XML Standards. It is designed to be highly adaptable and uses generic XQuery-calls to extract documentation data. Data can be stored in a Web Repository via Web Service
    Downloads: 0 This Week
    Last Update:
    See Project
  • Previous
  • You're on page 1
  • Next