Deploy pre-built tools that crawl websites, extract structured data, and feed your applications. Reliable web data without maintaining scrapers.
Automate web data collection with cloud tools that handle anti-bot measures, browser rendering, and data transformation out of the box. Extract content from any website, push to vector databases for RAG workflows, or pipe directly into your apps via API. Schedule runs, set up webhooks, and connect to your existing stack. Free tier available, then scale as you need to.
Explore 10,000+ tools
Atera all-in-one platform IT management software with AI agents
Ideal for internal IT departments or managed service providers (MSPs)
Atera’s AI agents don’t just assist, they act. From detection to resolution, they handle incidents and requests instantly, taking your IT management from automated to autonomous.
DB2RDF is a a software tool that will convert data from relational data model to semantic data model (in the form of RDF and RDFS). A SPARQL endpoint for querying the converted data. For querying the semantic data, SPARQL query language is used.
The objective of this project is the advancement of Diplomacy in the electronic realm by producing an XML Diplomacy data standard and a suite of server and client tools to facilitate Diplomacy play over the internet.
MACOW is a formal and scalable mandatory access implementation suitable on open worlds such as the provided on Semantic Web, Autonomic Computing and Coaltions and Federations scenarios. It is able to access control on distributed systems.
Creators discontinued Chiba & founded betterFORM http://betterform.de
The development of Chiba was discontinued in 2009.
The betterFORM Project by the same people who did Chiba is its successor. See http://www.betterform.de for further information
e-NVISION, STREP project of the 6th FP (http://www.e-nvision.org/).
This site contains the open-source software developed in the project.
Keywords: e-Business, ontologies, platform, semantic web services, BPEL processes.
XMPP Web Services for Java (XWS4J) is an implementation of machine to machine communication over XMPP. The communicated content is encoded in XML, according to customized definitions of input and output in W3C XML Schemata.
The Java Sitemap Parser can parse a website's Sitemap (http://www.sitemaps.org/). This is useful for web crawlers that want to discover URLs from a website that is using the Sitemap Protocol.
This project has been incorporated into crawler-commons (https://github.com/crawler-commons/crawler-commons) and is no longer being maintained.
Content adaptation platform that adapts internet content to any mobile device in real time. Includes features such as device detection, image transcoding, DOM manipulation, XML/RSS feed support and much more.
OrangeHRM provides a world-class HRIS experience and offers everything you and your team need to be that HR hero you know that you are.
Give your HR team the tools they need to streamline administrative tasks, support employees, and make informed decisions with the OrangeHRM free and open source HR software.
The Little Portal Gizmo is a small Java web application container based on Eclipse plug-ins and the Apache HTTP Core. It uses a custom and very light weight API. The web applications you develop for the Gizmo have the form of Eclipse plug-ins.
The project consist in an opensource implementation of the W3C Recommendation of the XML Key Management Specification 2.0 XKMS 2.0. It is compound of a XKMS Server and a Client API to access to the Server via Web Service.
SnapLogic is an OpenSource Data Integration framework that combines the power of state-of-the-art dynamic programming languages with standard Web interfaces to solve today's most pressing problems in data integration.
JODConverter automates conversions between office document formats using OpenOffice.org. Supported formats include OpenDocument, PDF, RTF, Word, Excel, PowerPoint, and Flash. It can be used as a Java library, a command line tool, or a Web application.
Semantic Web pipes is powerful data level mashup tools based on RDF.Semantic Web pipes are defined in XML and when executed they fetch RDF graphs on the Web, operate on them, and produce an RDF output which is itself accessible via a stable URL.
Meta-Messager: A project that aims to develop a Semantic Web based framework for semantic annotation of B2B message schemas (e.g. XML, EDIFACT, ASN.1) and translation of B2B messages to and from business ontologies.
A servlet (or a bunch of servlets) to access a database, perform ddlutils tasks (retrieve, create and update from or to a description). Uses the apache.org ddlutils (and some other libraries).
OWL2XMI is a java project to facilitate the creation of classes UML diagrams. It generates a XMI file from one ontology, this XMI file can be imported using UML tools such as: StarUML, ArgoUML, among others. There are available online version, Visit Proj
The DITA Open Platform is a free, open-source project which goal is to provide an enterprise platform for the edition, management and processing of DITA documents.
Water: a fast, secure, dynamic OO language and database. Water is an all-purpose language (and meta-language) that runs Web applications in the browser or server-side. Water is compatible with .NET, Java, and C on Windows, Linux and Mac OS X system.
A package that works with Apache Velocity and allows you to build web pages and other content from reusable Velocity template fragments, controlled by hierarchies of layout descriptions in XML.
COLLADA Refinery is a GUI-based application for creating content pipelines for 3D assets using modular components called conditioners. Contact a project admin to become a developer!