Deploy pre-built tools that crawl websites, extract structured data, and feed your applications. Reliable web data without maintaining scrapers.
Automate web data collection with cloud tools that handle anti-bot measures, browser rendering, and data transformation out of the box. Extract content from any website, push to vector databases for RAG workflows, or pipe directly into your apps via API. Schedule runs, set up webhooks, and connect to your existing stack. Free tier available, then scale as you need to.
Explore 10,000+ tools
Fully managed relational database service for MySQL, PostgreSQL, and SQL Server
Focus on your application, and leave the database to us
Cloud SQL manages your databases so you don't have to, so your business can run without disruption. It automates all your backups, replication, patches, encryption, and storage capacity increases to give your applications the reliability, scalability, and security they need.
Apolda is a plugin for the Gate framework (see http://sourceforge.net/projects/gate/) that annotates texts with labels of concepts from an arbitrary OWL-ontology.
AppleScripts for integrating BibDesk with Microsoft Word in Mac OS X. They allow non-destructive creation of bibliographies that draw their data from BibDesk publications. The aim is to approach the Microsoft Word integration of other software.
A simple package for OpenOffice, consisting of a document template and a set of macros intended to simplify the task of formatting stage plays, and to provide the playwright with an environment which promotes rapid writing.
As a healthcare provider, you should be paid promptly for the services you provide to patients. Slow, inefficient, and error-prone manual coding keeps you from the financial peace you deserve. XpertDox’s autonomous coding solution accelerates the revenue cycle so you can focus on providing great healthcare.
Make your file PDF with one interface friendly, all is visual, only you have to click and make your own PDF, is totally free, is make in JAVA. You can make PDF profesional with this tool.
Applies a simple SQL query (without joins) to a tab-separated-value text file. Intended for the command line, like "txtselect 'c2 from states.txt where c3 > 1000000'." Distributed as a single Python source-code file.
TagParser is a java parser based on CSS formulas (like JQuery) and can parse any documents based on tags such as XML, HTML. Furthermore, it doesn't require documents to be well formed and can parse complex documents with embedded scripts or CSS parts
DAT Freight and Analytics operates DAT One truckload freight marketplace
DAT Freight & Analytics operates DAT One, North America’s largest truckload freight marketplace; DAT iQ, the industry’s leading freight data analytics service; and Trucker Tools, the leader in load visibility. Shippers, transportation brokers, carriers, news organizations, and industry analysts rely on DAT for market trends and data insights, informed by nearly 700,000 daily load posts and a database exceeding $1 trillion in freight market transactions. Founded in 1978, DAT is a business unit of Roper Technologies (Nasdaq: ROP), a constituent of the Nasdaq 100, S&P 500, and Fortune 1000. Headquartered in Beaverton, Ore., DAT continues to set the standard for innovation in the trucking and logistics industry.
JSpell is a Java API for providing text-checking functionality (spell checkers, grammar checkers, style checkers etc.) to Java programs. It is designed to support all text-checking engines via a plugin architecture, and to be as easy to use as possib
A 100% Java client for the DICT protocol (RFC2229). This provides access to lexicons, translating dictionaries, thesauri and similar database over a TCP/IP protocol.
AnissTranslator is the open source development of Hebrew and spoken arabic online translation. It is devleoped by the peace-seeking Aniss organization and intended to be used by web applications or as a standalone php package
otl is a text processor for generating markup from plain text. Much of both the input and output formats can be customized. otl supports structures such as nested ordered lists, headers and footers, and tables.
TextMarker is now developed and hosted at Apache UIMA (http://uima.apache.org/textmarker.html). TextMarker is a UIMA-based tool for information extraction and more. The full featured editor of the rule language and the build process of UIMA descriptors are complemented with components for visualization, explanation, testing and rule learning.
Camomile is a Unicode library for ocaml. Camomile
provides Unicode character type, UTF-8, UTF-16, UTF-32 strings,
conversion to/from about 200 encodings, collation and locale-sensitive
case mappings, and more.
Plugins for Firefox and Google Chrome that automates usage of „Typograf“ service hosted at http://www.artlebedev.ru/tools/typograf/. Plugin takes text from any text area in Firefox and processes it according to typographic rules (e.g. inserts typ
BonGoLipi (Bong-Go-Lipi) is a transliteration tool to convert phonetically typed Bengali (Bangla) into text displayable with Unicode or non-Unicode fonts. It supports different transliteration schemes. The objective is to propagate standardized Bengali.
A simple way to create a syntax highlighting editor for a custom language/grammar and/or create custom grammar parsers. This is a .NET project written in C#. See details here: http://acct001.com/wordpress/?p=190