Crawl websites, sync to vector databases, and power RAG applications. Pre-built integrations for LLM pipelines and AI assistants.
Build data pipelines that feed your AI models and agents without managing infrastructure. Crawl any website, transform content, and push directly to your preferred vector store. Use 10,000+ tools for RAG applications, AI assistants, and real-time knowledge bases. Monitor site changes, trigger workflows on new data, and keep your AIs fed with fresh, structured information. Cloud-native, API-first, and free to start until you need to scale.
Try for free
Skillfully - The future of skills based hiring
Realistic Workplace Simulations that Show Applicant Skills in Action
Skillfully transforms hiring through AI-powered skill simulations that show you how candidates actually perform before you hire them. Our platform helps companies cut through AI-generated resumes and rehearsed interviews by validating real capabilities in action. Through dynamic job specific simulations and skill-based assessments, companies like Bloomberg and McKinsey have cut screening time by 50% while dramatically improving hire quality.
Bookmark sync tool for ebook readers with xml bookmark system. Primarily focussed on devices based on 4FFF N618 ebook reader, wich uses Adobe Digital Editions for content sync but have no bookmark sync.
Lightweight XML processor; XML-POJO mapping via Java5 annotations or DTD; Preprocessing of XML documents using expression language; Binary XML; RMI friendly XML; JSON format support; XML marshall/unmarshall; HTML as XML parser; Swing XML Viewer
HTTP functional and non-functional (load and performance) toolkit based on jython/grinder (http://grinder.sf.net) ...includes capabilities to support: SOA services, REST, json/xml encoding, AES and WS security ... and a stub to collect requests
PHP web API designed to simplify object handling(loading, saving, querying, displaying, and editing), abstract the data from its display structure, and layout and allow the target data to be delivered to any supported format without special logic.
The London Datastore (http://data.london.gov.uk) was created by the Greater London Authority (GLA) as an innovation towards freeing London’s data. This SourceForge Project will be used to Open Source our development efforts surrounding data formats
XMUltra is a Java and XML based feed-processing framework. It features utilities for many common tasks, such as:
*FTPing files
*Zipping files
*Watching directories
*XSL transformation
*Cleaning out archives
This project defines a walk-up-and-use, wiki-like experience designed to encourage the widest possible deployment of basic DITA writing capability across new communities of users, whose requirements can help drive additional DITA tools and services.
A processor for xml-transformations based on a processing model, which allows to define the transformations by stepwise refinement, in structured (imperative) programming languages and with respect to a given xml-target language.
Secure and customizable compute service that lets you create and run virtual machines.
Computing infrastructure in predefined or custom machine sizes to accelerate your cloud transformation. General purpose (E2, N1, N2, N2D) machines provide a good balance of price and performance. Compute optimized (C2) machines offer high-end vCPU performance for compute-intensive workloads. Memory optimized (M2) machines offer the highest memory and are great for in-memory databases. Accelerator optimized (A2) machines are based on the A100 GPU, for very demanding applications.
This is a generic XML to RDF converter which uses XSLT transformations to convert any XML document into RDF format.
The transformation uses an XSLT processor like xsltproc. The command line for the Bash shell is:
xsltproc xml2rdf3.xsl document.xml > document.rdf
Reference:
Breitling, F. 2009: A standard transformation from XML to RDF via XSLT, Astronomical Notes, Vol 330 Issue 7, DOI: 10.1002/asna.200811233,
http://onlinelibrary.wiley.com/doi/10.1002/asna.200811233/abstract...
A framework for all-platform GUI-development based on completely separating business from presentation logic and introducing methods of abstraction that allow stepwise refinement of the GUI-definition.
This is a project aimed to create a most unified cross-language japanese dictionary out there.
So far project includes:
* xKanjiDic - eXtended Kanji Dictionary
* xJDict - eXtended Japanese Dictionary (XML and others in developement)
CpX is an XML-based lightweight C++ development environment.It’s a philosophy of simple OO software engineering, with a C++ subset & base classes.It has general-purpose useful C++ services & tasks to speed up build.Tired? A Sudoku game is given in CpX as a bonus.
A Java-based in-application help system that can read multiple different documentation file formats, producing professional help systems from multiple sources. Documentation included for testing is from the jAudio project under the latest LGPL licens
CuppaWEB is an open source enterprise content management system (CMS) built by CuppaIT. Its modular architecture uses the latest open source Java/Ajax technologies.
XMP PHP Toolkit Extension is a PHP module which include the Adobe XMP Toolkit SDK. The main functions from adobe XMP will be available from PHP as classes and methods. The actual release 2.0 is based on the new Adobe XMP Toolkit SDK 5.1.2.
Given an XSD schema and an XSA.xml configuration file, XML Skeleton Annotations (XSA) generates JSF forms UI to create XML records compliant to the XSD and following the XML skeleton defined in XSA.xml, still keeping everything under your control.
Mistral-IdM is a project whose aim is to provide an identity management system, with advanced authentication and authorization abilities, based on standards (SAML, XACML, XKMS), providing a user-friendly administration console.
now here: https://github.com/plastex/plastex
plasTeX is a Python-based LaTeX document processing framework. It gives DOM-like access to a LaTeX document, as well as the ability to generate mulitple output formats (e.g. HTML, DocBook, tBook, etc.).
XPN is relies on a non relational native XML database, where XML documents are stored in a compressed form and indices enables fast access to structure and content, thus enabling a fast evaluation of XQuery queries.
A Qt-based Application used to collect geographical information from twitter accounts and display them using Google Maps API. It will also include features to follow links and extract similar information from image sites like Flickr and Dailybooth.
Trial Criteria Online Data Entry (trialCODE): a Java-based user interface that codifies eligibility requirements used to automate the screening of potential subjects to clinical trials. Used for caMATCH screening engine on BreastCancerTrials.org site.
The gateway is an open source JavaEE application developed by the Vermont Dept of Taxes. It provides a web services framework for accepting Streamlined Sales Tax registrations and returns. It also includes a web interface for submitting transmissions.
XML Processor. A Multi-threaded, Pub/Sub environment for Dynamic programming on an event driven Tickless and Sleeping State Machine with TCP communications, tight flawless memory management, powerful set algebra and a magical database. 100% C++. ezPort.