Crawl websites, sync to vector databases, and power RAG applications. Pre-built integrations for LLM pipelines and AI assistants.
Build data pipelines that feed your AI models and agents without managing infrastructure. Crawl any website, transform content, and push directly to your preferred vector store. Use 10,000+ tools for RAG applications, AI assistants, and real-time knowledge bases. Monitor site changes, trigger workflows on new data, and keep your AIs fed with fresh, structured information. Cloud-native, API-first, and free to start until you need to scale.
Try for free
Atera all-in-one platform IT management software with AI agents
Ideal for internal IT departments or managed service providers (MSPs)
Atera’s AI agents don’t just assist, they act. From detection to resolution, they handle incidents and requests instantly, taking your IT management from automated to autonomous.
Joomla! component to achieve a semantic extensión of the WCMS It converts the infrastructure of Joomla! into the semantically equivalent. It allows both manual and automatic semantic annotations as well as enhanced search capabilities.
This library provides a Ruby interface to Nmap's scan data. This information is presented in an easy-to-use and intuitive fashion for storage and manipulation.
PHP web API designed to simplify object handling(loading, saving, querying, displaying, and editing), abstract the data from its display structure, and layout and allow the target data to be delivered to any supported format without special logic.
The London Datastore (http://data.london.gov.uk) was created by the Greater London Authority (GLA) as an innovation towards freeing London’s data. This SourceForge Project will be used to Open Source our development efforts surrounding data formats
Trial Criteria Online Data Entry (trialCODE): a Java-based user interface that codifies eligibility requirements used to automate the screening of potential subjects to clinical trials. Used for caMATCH screening engine on BreastCancerTrials.org site.
Total Network Visibility for Network Engineers and IT Managers
Network monitoring and troubleshooting is hard. TotalView makes it easy.
This means every device on your network, and every interface on every device is automatically analyzed for performance, errors, QoS, and configuration.
The gateway is an open source JavaEE application developed by the Vermont Dept of Taxes. It provides a web services framework for accepting Streamlined Sales Tax registrations and returns. It also includes a web interface for submitting transmissions.
XTRACT4J V2 is a stand-alone, pure-Java program which creates XML file by dependent or independent SQL queries. It is designed as a drop-in replacement for Oracle Report to generate XML file. It also incorporates BI Publisher to create PDF reports.
The goal of the OSM2PostGIS project is to develop an OpenStreetMap data processor that can import any amount of .osm XML data into a PostGIS database, and produce evaluated navigation network topologies to support routing.
This is MySQL Proxy service written in PHP. All outputs of the queries are represented as XML data. This service can be placed in the DMZ of your server. Connect your application data layer to this service. Example in C# is bundled in the archive.
Laika analyzes and reports on the interoperability capabilities of EHR systems. This includes the testing for certification of EHR software products and networks.
PgAtomServer is an open source Atom Publishing Protocol server based on PHP and PostgreSQL.
Inspired by RFC 5023, PgAtomServer aims to manage CRUD operations on Web resources through HTTP, XML and ReST. Data is stored in a PostgreSQL database.
A suite of emergency notification tools using the Common Alerting Protocol format for messages. Includes a web interface for posting, administration, and public viewing, public and monitored alert monitors and a mass SMS / email sender.
PHP libraries which create an XML-based API for storing data in (and retrieving it from) a database. API allows for one or multiple nodes to push/pull data from one or multiple stores, each with one or multiple items allowed.
The CampusSource Engine Integration Platform (CSE-IP) supports the composition of an integrated information managment in heterogeneous IT environments of educational institutions. It realises a process-oriented link-up of campus information systems.
Gemibloo is a fully open and very extensible Geo Micro blogging platform thanks to its import and export plugins. Geolocation is extracted from pictures sent by mail from mobile phones, displayed on a map, and used to update Twitter & Fire Eagle.
A servlet (or a bunch of servlets) to access a database, perform ddlutils tasks (retrieve, create and update from or to a description). Uses the apache.org ddlutils (and some other libraries).
Sushee is an XML Office Management Framework: a set of application development tools designed to manage contents and activities (companies, institutions, associations, etc.) in a multi-language, multi-channel, multi-format and multi-project context.
Interoperability enabling tools, for exchanging technical data between companies, or for internal data integration, alpha-numeric, 2D or 3D data. Compatible with ISO 15926 and expressed in ontology using RDF/OWL.
Send XMPP (Gtalk/Google Talk also supported) messages (thanks to Jabber) within a single line of PL/SQL code, get buddies list and status in relational tables (or xml) on Oracle 11g.