Deploy pre-built tools that crawl websites, extract structured data, and feed your applications. Reliable web data without maintaining scrapers.
Automate web data collection with cloud tools that handle anti-bot measures, browser rendering, and data transformation out of the box. Extract content from any website, push to vector databases for RAG workflows, or pipe directly into your apps via API. Schedule runs, set up webhooks, and connect to your existing stack. Free tier available, then scale as you need to.
Explore 10,000+ tools
Cloud data warehouse to power your data-driven innovation
BigQuery is a serverless and cost-effective enterprise data warehouse that works across clouds and scales with your data.
BigQuery Studio provides a single, unified interface for all data practitioners of various coding skills to simplify analytics workflows from data ingestion and preparation to data exploration and visualization to ML model creation and use. It also allows you to use simple SQL to access Vertex AI foundational models directly inside BigQuery for text processing tasks, such as sentiment analysis, entity extraction, and many more without having to deal with specialized models.
Network software for annotation and quality control of distributed data. Supported by NSF: DBI 0646266 and NSF: DBI 0960535. http://wiki.filteredpush.org
The Common Lisp Reasoner extends the Common Lisp Object System (CLOS) to incorporate a powerful rule language suitable for all kinds of reasoning tasks, vanilla XML and RDF/XML interfaces, and support for a variety of AI-related applications, such as scheduling, planning and diagnosis.
This project is no longer actively maintained. It is succeeded by the Eclipse RDF4J project, which can be found at GitHub and at http://www.rdf4j.org/.
Sesame is a de-facto standard framework for processing RDF data. This includes parsing, scalable storage, reasoning and full SPARQL 1.1 query/update support. Sesame offers a fully modular toolkit and an easy-to-use Java API that can be connected to all leading RDF storage solutions.
Inventors: Validate Your Idea, Protect It and Gain Market Advantages
SenseIP is ideal for individual inventors, startups, and businesses
senseIP is an AI innovation platform for inventors, automating any aspect of IP from the moment you have an idea. You can have it researched for uniqueness and protected; quickly and effortlessly, without expensive attorneys. Built for business success while securing your competitive edge.
Lets you store and query data within the wiki's pages.
Semantic MediaWiki is an extension to the MediaWiki software (powering Wikipedia), which extends the Wiki with ideas from the Semantic Web. We focus on usability and tight integration.
See the web site for further details and GitHub at https://github.com/SemanticMediaWiki/SemanticMediaWiki/releases for the file releases! (since version 2.4.0).
dotNetRDF is a .Net Class Library for working with RDF and SPARQL programatically built with C# 3 for the .Net Framework 3.5/4.0/Silverlight 4 and WIndows Phone 7. It provides a simple object model for representing RDF, SPARQL Queries and Updates and understands all common RDF and SPARQL syntaxes.
Extensive documentation is available on the website and support is available via the mailing lists.
RDFConvert is a simple command-line tool for converting RDF file betweeen different syntax formats. It is based on the OpenRDF Rio parser toolkit, and currently supports RDF/XML, Trig, Trix, Turtle, N3, N-Triples, RDF/JSON, JSON-LD, Sesame Binary RDF and N-Quads.
OntoComP is a Protégé 4 plug-in for completing OWL ontologies
OntoComP is a Protégé 4 plug-in for completing OWL ontologies. It enables the user to check whether an OWL ontology contains "all relevant information" about the application domain, and extends the ontology appropriately if this is not the case. It achieves this by asking the user questions of the form "are instances of classes C1 and C2 also instances of class C3?". If the user confirms such a question, then a new axiom of the application domain that does not follow from the ontology has...
Next-Gen Encryption for Post-Quantum Security | CLEAR by Quantum Knight
Lock Down Any Resource, Anywhere, Anytime
CLEAR by Quantum Knight is a FIPS-140-3 validated encryption SDK engineered for enterprises requiring top-tier security. Offering robust post-quantum cryptography, CLEAR secures files, streaming media, databases, and networks with ease across over 30 modern platforms. Its compact design, smaller than a single smartphone image, ensures maximum efficiency and low energy consumption.
ROO is an (OWL) ontology construction tool based on Protégé 4. ROO makes the ontology building process easier as it provides guidance about the steps involved and it allows to enter knowledge using an easy to learn controlled natural language.
Program to transform any XML document to a RDF one.
Some scenarios for neuroinformatics experiments are saved in standard XML document. This program should be able to parse almost any scenario so far to make a full-valuable RDF file, where every pieces of scenario has their own URI.
This project is from University of West Bohemia in Czech republic, required to Fundamentals of Software Engineering (KIV/ZSWI).
The “RDB2RDF Mapper” is a Java based software desktop application which allows a user to create a mapping file between a relational database and an RDFS/OWL ontology via a graphical user interface. This project is ANDS-funded.
PHPTMAPI is a programming interface for PHP based on the TMAPI project (www.tmapi.org). This API enables PHP developers an easy and standardized implementation of ISO/IEC 13250 Topic Maps in their applications.
SpatiumCube is open source technology for the easily development of Spatial Data Infrastructures (SDIs) and services over them. It includes software, stylesheets, ontologies, and other technology elements for the development of SDIs.
NG4J - 'Named Graphs API for Jena' is an extension to the Jena Semantic Web framework for parsing, manipulating and serializing sets of Named Graphs. For details about Named Graphs see http://www.w3.org/2004/03/trix/
SPARQL to Object Oriented eNgine allows users to expose their traditional data sources (RDBMS, OODBMS, XML, LDAP, ecc) as RDF data, through a SPARQL endpoint.