Deploy pre-built tools that crawl websites, extract structured data, and feed your applications. Reliable web data without maintaining scrapers.
Automate web data collection with cloud tools that handle anti-bot measures, browser rendering, and data transformation out of the box. Extract content from any website, push to vector databases for RAG workflows, or pipe directly into your apps via API. Schedule runs, set up webhooks, and connect to your existing stack. Free tier available, then scale as you need to.
Explore 10,000+ tools
All-in-one security tool helps you prevent ransomware and breaches.
SIEM + Detection and Response for IT Teams
Blumira’s detection and response platform enables faster resolution of threats to help you stop ransomware attacks and prevent data breaches. We surface real threats, providing meaningful findings so you know what to prioritize. With our 3-step rapid response, you can automatically block known threats, use our playbooks for easy remediation, or contact our security team for additional guidance. Our responsive security team helps with onboarding, triage and ongoing consultations to continuously help your organization improve your security coverage.
Emerald Text Editor is a tabbed text editor with heavy customizability
Emerald Text Editor (Emerald Editor, or Emerald as I call it), formerly called jEditor, is a text editor that is much similar to notepad in the fact that it let's you edit text but it makes use of the tabbed panes which means that you can have multiple tabs up at once allowing you to edit multiple files at one time.
Emerald Text Editor also comes with a toolbar which tells you how quickly you are typing and how many characters are in your current document. The program is also...
OmniHelp is a cross-platform, browser-independent, tri-pane help viewer built in pure JavaScript and CSS with HTML 4. Some functions (such as help embedding) may in the future be in Java, C, or C++; CSH is fully supported. All code is under the LGPL.
Cloud data warehouse to power your data-driven innovation
BigQuery is a serverless and cost-effective enterprise data warehouse that works across clouds and scales with your data.
BigQuery Studio provides a single, unified interface for all data practitioners of various coding skills to simplify analytics workflows from data ingestion and preparation to data exploration and visualization to ML model creation and use. It also allows you to use simple SQL to access Vertex AI foundational models directly inside BigQuery for text processing tasks, such as sentiment analysis, entity extraction, and many more without having to deal with specialized models.
Tools to update all pages of a web site, at a command. Header and menu may be copied on each page. It is possible also to add a header to all source files of a project (a licence for example).
Applies a simple SQL query (without joins) to a tab-separated-value text file. Intended for the commandline, like "txtselect 'c2 from states.txt where c3 > 1000000'." Distributed as a single Python source-code file.
regexxer is a nifty GUI search/replace tool featuring Perl-style regular expressions. If you need project-wide substitution and you're tired of hacking sed command lines together, then you should definitely give regexxer a try.
otl is a text processor for generating markup from plain text. Much of both the input and output formats can be customized. otl supports structures such as nested ordered lists, headers and footers, and tables.
AI-based, Comprehensive Service Management for Businesses and IT Providers
Modular solutions for change management, asset management and more
ChangeGear provides IT staff with the functions required to manage everything from ticketing to incident, change and asset management and more. ChangeGear includes a virtual agent, self-service portals and AI-based features to support analyst and end user productivity.
Find And Replace Text commandline utility. New & improved version of the well-known grep command, with advanced features such as: case-adaption of the replace string; find (& replace) in filenames, auto CVS edit.
Moved to https://github.com/lionello/fart-it
Filecmp is a command-line application that gets two filenames as argument and outputs the comparison between them - e.g. if they are the same or not... it may look irrelevant but sometime it's very useful, specially inside scripts.
transpose is a fast and efficient command-line tool that transposes (the rows become the columns, the columns become the rows) or reshapes a matrix in plain text. Inputs and outputs can be files or streams.
Rephrase is a simple string replacement application. The default package comes with a wordiness rule file, a 1337 rule file, and an English to French rule file. It can be used on the commandline as part of a set of other tools as well.
pdfreflow is a commandline utility that operates on the output of the poppler utility called pdftohtml. pdfreflow reflows the texts into paragraphs, while at the same time removing hyphenation and page numbers, headers and footers.
Apostilator is a XML based front end to LaTeX that simplifies the process of writing all kinds of documents from letters to books, reports and articles. It's easy to customize and create templates defining a style for all your documents.
GOSH takes an Usenet-styled ASCII text as input and generates Latex-code, HTML, or Man-pages. Its transformation is easily customizable, which makes GOSH suited for creating web content, papers, letters, and technical documentation.
Build texts and documents bottom-up from building blocks using a language similar (and compatible with) that of GNU Make, in such a way that variants (e.g. translations) can be generated by overwriting parts of the text with minimal redundancy.
A stand-alone editor using Mediawiki markup language to generate HTML code. You can create and preview pages written using Mediawiki markup (i.e. Wikipedia pages) while off-line.
HTMLtools includes several Java HTML tools for preparing Web pages. The HTMLtools program automates batch conversion of tab-delimited spreadsheet text files to HTML Web-page files, file & table editing, keyword mapping, templates, and more.