Deploy pre-built tools that crawl websites, extract structured data, and feed your applications. Reliable web data without maintaining scrapers.
Automate web data collection with cloud tools that handle anti-bot measures, browser rendering, and data transformation out of the box. Extract content from any website, push to vector databases for RAG workflows, or pipe directly into your apps via API. Schedule runs, set up webhooks, and connect to your existing stack. Free tier available, then scale as you need to.
Explore 10,000+ tools
Dun and Bradstreet Connect simplifies the complex burden of data management
Our self-service data management platform enables your organization to gain a complete and accurate view of your accounts and contacts.
The amount, speed, and types of data created in today’s world can be overwhelming. With D&B Connect, you can instantly benchmark, enrich, and monitor your data against the Dun & Bradstreet Data Cloud to help ensure your systems of record have trusted data to fuel growth.
Bibgrep indexes and searches BibTex files for entries matching
a given query. Its usage is similar to the command ``grep'' and
the queries uses a Google-like syntax.
A UNIX command-line utility written in Perl that takes a raw Project Gutenberg text file and converts it to highly formatted HTML.
Designed to make reading Gutenberg eBooks much easier and more enjoyable.
Paragraph justification in any of the following formats: left-justified, right-justified, centered, or both left- and right-justified, with option for word hyphenation in case the word does not fit on a line
The Doc2Html commandline operating program strippes the Word produced html files (by opening the documet, saving as html) leaving pure text + minimum html code. It also has a mode to convert data berween different charsets: DOS, Windows-1250 and ISO-8859
Replacehex is a commandline search and replace utility that works with hex codes strings. It can operate on both binary and text files efficiently.
A great tool to work on binary data files. First developed to replace mac addresses inside a pcap capture
Commandline XML parser using "expat" libraries allowing you to on the fly extract / add / modify / delete / split / format / unformat / count tag value, name and attributes. Usefull for shell scripting on UNIX or Linux based systems.
One-Liner creates one-line paragraphs from CR/LF- or LF-delimited text.
Scanners normally return delimited text; substitution of complementary characters such as `` and \'\' for \" in paragraphs becomes easier when the delimiters are removed.
Create and convert pipeline at scale through industry leading SMS campaigns, automation, and conversation management.
TextUs is the leading text messaging service provider for businesses that want to engage in real-time conversations with customers, leads, employees and candidates. Text messaging is one of the most engaging ways to communicate with customers, candidates, employees and leads. 1:1, two-way messaging encourages response and engagement. Text messages help teams get 10x the response rate over phone and email. Business text messaging has become a more viable form of communication than traditional mediums. The TextUs user experience is intentionally designed to resemble the familiar SMS inbox, allowing users to easily manage contacts, conversations, and campaigns. Work right from your desktop with the TextUs web app or use the Chrome extension alongside your ATS or CRM. Leverage the mobile app for on-the-go sending and responding.
This is a small perl command-line script that translates among several languages, using on-line web translators. It can be useful to translate small phrases or words from a shell without any browser.
This is a very simple Perl script that will take a file of unknown line endings UNIX, MAC, or DOS/Windows, defaulting converting the file to the UNIX style line endings.
Project is aimed at syntax highlighting of code. It is not strictly oriented for any concrete language and enables user to define own language configuration.
A Java-based spellchecker which focuses on automatic spelling correction by incorporating lingustic and statistical approaches. Development is done by ASV (Abteilung Automatische Sprachverarbeitung) of Leipzig University.
SrcComDoc allows documentation written in source comment rows to be extracted, formatted and highlighted according to the chosen documentation format. The basic SrcComDoc syntax is source and documentation language independent.
Adapt is data conversion language developped in 1984 by Norman W. Molhant and Christophe Dupriez. It has been used in many circumstances, it translated itself in many programming environment and it should evolve now toward modern environments like Java.
vie consists of a set of shell scripts, which extends the standard UNIX vi editor with some simple but usefull implementations of popular enhancements found in vi clones, e.g.: infinite undo, syntax highlighting, commandline history or a file explorer.