Deploy pre-built tools that crawl websites, extract structured data, and feed your applications. Reliable web data without maintaining scrapers.
Automate web data collection with cloud tools that handle anti-bot measures, browser rendering, and data transformation out of the box. Extract content from any website, push to vector databases for RAG workflows, or pipe directly into your apps via API. Schedule runs, set up webhooks, and connect to your existing stack. Free tier available, then scale as you need to.
Explore 10,000+ tools
Dun and Bradstreet Risk Analytics - Supplier Intelligence
Use an AI-powered solution for supply and compliance teams who want to mitigate costly supplier risks intelligently.
Risk, procurement, and compliance teams across the globe are under pressure to deal with geopolitical and business risks. Third-party risk exposure is impacted by rapidly scaling complexity in domestic and cross-border businesses, along with complicated and diverse regulations. It is extremely important for companies to proactively manage their third-party relationships. An AI-powered solution to mitigate and monitor counterparty risks on a continuous basis, this cutting-edge platform is powered by D&B’s Data Cloud with 520M+ Global Business Records and 2B+ yearly updates for third-party risk insights. With high-risk procurement alerts and multibillion match points, D&B Risk Analytics leverages best-in-class risk data to help drive informed decisions. Perform quick and comprehensive screening, using intelligent workflows. Receive ongoing alerts of key business indicators and disruptions.
MacBibTk is a Mac compatible version of Peter Corke's tkbibtex (release 9), a BibTeX file editor and browser. BibTeX is a reference/citation system for use with LaTeX. MacBibTk runs on all platforms with Tcl/Tk ports.
FileExtender is a Perl script to evaluate embedded SQL statements in any kind of text file (incl. HTML files) and extends these files with results from the database queries.
This tool is designed to help break a project down into smaller and smaller chunks, allowing you to go into fine detail without losing sight of the big picture.
Particularly good for certain types of dyslexia.
It is a tool that converts source codes(such as c/c++) directory tree
to XML files tree.It also support easy-done plugins for free your-designed
source code converting method!
Realistic Workplace Simulations that Show Applicant Skills in Action
Skillfully transforms hiring through AI-powered skill simulations that show you how candidates actually perform before you hire them. Our platform helps companies cut through AI-generated resumes and rehearsed interviews by validating real capabilities in action. Through dynamic job specific simulations and skill-based assessments, companies like Bloomberg and McKinsey have cut screening time by 50% while dramatically improving hire quality.
A Lightweight XML DTD and Style-Sheets for the structuring of Resumes and Job Listings. The purpose is to implement a simple and balanced data exchange format between job seekers, employers, recruiters, and listing services.
This project is ON HOLD at
Collection of tools to convert data from Windows Treepad files to Unix Yank format and vice versa. The following tools are available:
hjt2yank,
yank2hjt,
and hjtyank windows frontend.
For companies looking to automate their consolidation and financial statement function
The software is cloud based and automates complexities around consolidating and reporting for groups with multiple year ends, currencies and ERP systems with a slice and dice approach to reporting. While retaining the structure, control and validation needed in a financial reporting tool, we’ve managed to keep things flexible.
NEchoes has been superseded by n-spell: http://nedit.gmxhome.de/n-spell.html. NEchoes is a package for the "NEdit" editor that enables it to use the "Aspell" spell-checker to highlight mistakes continuously as you type, and help you correct them.
This project is to develop a tool written in PERL which will preprocess HTML files containing special directives written as HTML comments to create a document level index of sections and section level indices of subsections.
This is a small perl command-line script that translates among several languages, using on-line web translators. It can be useful to translate small phrases or words from a shell without any browser.
This is a very simple Perl script that will take a file of unknown line endings UNIX, MAC, or DOS/Windows, defaulting converting the file to the UNIX style line endings.
When translating becomes a game ! Text to translate can be graphically selected. Several dictionnaries can be sorted according to the context. A large choice of matching strategies is available. The OCR engine is tunable.
Open-Tamil is a full featured Tamil text processing library in Python. It works fully in Python 2, 3.
Published via pip - python package index.
See: https://pypi.python.org/pypi/Open-Tamil/0.67
The SchemaWalker is a Java application able to read a any schema and produce XForms web pages for user selected nodes grouped into webpages to allow editing of XML data files.
JFindReplace is a java based find and replace utility. It allows you to select files and folders. You can replace text in several thousands of files in one click. It supports regular expressions and have a neat UI.
A Java-based spellchecker which focuses on automatic spelling correction by incorporating lingustic and statistical approaches. Development is done by ASV (Abteilung Automatische Sprachverarbeitung) of Leipzig University.
A little tool which provide opportunity to manage data stored in system clipboard. This project uses java as a language. And don't depends on OS, desktop environment etc...
TX is a free and opensource cross-platform Computer Assisted Translation tool with translation memory, full and partial matches, machine translation, glossaries, search engine, support for various document types, and translation projects.
Project is aimed at syntax highlighting of code. It is not strictly oriented for any concrete language and enables user to define own language configuration.