Deploy pre-built tools that crawl websites, extract structured data, and feed your applications. Reliable web data without maintaining scrapers.
Automate web data collection with cloud tools that handle anti-bot measures, browser rendering, and data transformation out of the box. Extract content from any website, push to vector databases for RAG workflows, or pipe directly into your apps via API. Schedule runs, set up webhooks, and connect to your existing stack. Free tier available, then scale as you need to.
Explore 10,000+ tools
Run applications fast and securely in a fully managed environment
Cloud Run is a fully-managed compute platform that lets you run your code in a container directly on top of scalable infrastructure.
Run frontend and backend services, batch jobs, deploy websites and applications, and queue processing workloads without the need to manage infrastructure.
NEW: Version 1.10.0 is out - Apple Silicon is now natively supported.
dcpTool is a tool for editing DNG camera profiles (DCP files). dcpTool can convert DCP files to and from binary form to an editable XML form, and can also directly do a number of useful transformations to DCP files such as "untwisting" them.
**CODE MOVED TO GITHUB: https://github.com/bitextor **
Bitextor is an application created to generate translation memories using multilingual websites as a corpus source. It downloads an entire website and applies a set of heuristics (based mainly on HTML tag structure and text block length) to find bitexts.
Easy-to-use online form builder for every business.
Create online forms and publish them. Get an email for each response. Collect data.
Easy-to-use online form builder for every business. Create online forms and publish them. Get an email for each response. Collect data. Design professional looking forms with JotForm Online Form Builder. Customize with advanced styling options to match your branding. Speed up and simplify your daily work by automating complex tasks with JotForm’s industry leading features. Securely and easily sell products. Collect subscription fees and donations. Being away from your computer shouldn’t stop you from getting the information you need. No matter where you work, JotForm Mobile Forms lets you collect data offline with powerful forms you can manage from your phone or tablet. Get the full power of JotForm at your fingertips. JotForm PDF Editor automatically turns collected form responses into professional, secure PDF documents that you can share with colleagues and customers. Easily generate custom PDF files online!
bnf2xml a simple BNF parser that takes text as input, searches according to a BNF query file, and outputs text marked up by the xml labels that show context.
bnf2xml is as simple to use as any text binary ie, awk(1) grep(1). bnf2xml does not require C API because it outputs simple xml labeling.
README is visible on file dl page.
EXAMPLE: $ echo "hi" | bnf2xml patternfile
<word><alph>h</alph><alph>i</alph></word>
or
<gas>hydrogen iodide</gas>
patternfile says how to find...
Linknx is an automation platform providing high level functionalities to EIB/KNX installation. The rules engine allows execution of actions based on complex logical conditions and timers. Lightweight design allows it to run on embedded Linux.
The project has been migrated to Github in 2015: https://github.com/linknx/linknx
libxmldiff is a library to provide diff functions for XML files. It is shipped with xmldiff, a simple commandline tool to demonstate libxml functionnalities. See the project xmltreenav for a GUI.
The G(arbage) C(ollected) X(Query) engine is the first streaming XQuery engine that implements active garbage collection, a novel buffer management strategy in which both static and dynamic analysis are exploited.
Open Source Implementations for 3D-Surface Characterisation Algorithms according to ISO 25178 (Geometric Product Specification) in verifiable pseudocode (MATLAB). Implementation of an XML-based file exchange format according to upcoming ISO 25178-72.
Elementary, Middle and High Schools, K-8, K-12, Private, Charter, College Departments and Trade/Technical Schools
Help your school save time, increase enrollment, and achieve its mission with Gradelink. An award-winning student information system and school management system, Gradelink school management software system is suitable for Preschool through High school. Gradelink is a great fit for higher education and for small colleges. Gradelink combines school management, teaching, and learning tools to help schools perform their best. Top features include attendance management, report cards, classes and scheduling, standards-grading system, communications, student information, and student/parent access. Lesson plans, grade sheets, parent communication and custom reports all work together in perfect unison. Gradelink is ideal for K8 schools, private schools and Charter schools.
pdf2xml convertor based on Xpdf library (http://www.foolabs.com/xpdf/home.html). It converts information contained in a PDF file into XML. First, you need to install xpdf and libxml2 (see documentation).
Hervé Déjean
Xerox Research Centre Europe
http://www.xrce.xerox.com/About-XRCE/People/Herve-Dejean
XmlPL is a C like language with special syntax for creating and manipulating XML data. If you know Java, C or C++ and XPath then XmlPL is easy to learn. XML is a native data type and is processed more naturally using XML path expressions and inline XML.
Compiler and interpreter for the Scriptol programming language. Scriptol is object oriented and the first language to use XML as data structure in sources. It is easy to learn and safe. Also, RSS readers in PHP, an RSS feed editor.
XML Processor. A Multi-threaded, Pub/Sub environment for Dynamic programming on an event driven Tickless and Sleeping State Machine with TCP communications, tight flawless memory management, powerful set algebra and a magical database. 100% C++. ezPort.
wsdlpull is an efficient and powerful commandline utility for dynamic inspection and invocation of WSDL web services.It provides a C++ library with dynamic WSDL invocation API,WSDL parser,Schema parser and Validator and an xmlpull Parser/Serializer.
That project aims at providing a framework around the WSDLPull (http://wsdlpull.sourceforge.net) source code, so as to ease the packaging and delivery processes, allowing to deliver packages for a great number of Linux distributions and Unices.
Xemeiah is a fast, modular and scalable XML Framework written in C++, with an efficient DOM and Oasis-compliant XSLT Processor. Xemeiah modules include a persistence layer, a fast Ajax Web Server, a Media Player, ImageMagick frontend, java bindings...
A commandline utility to display statistics about a text file consisting of lines of data. The statistics include counts of line terminator pairs (CR, LF, CR+LF) and line counts. Also shows if there is an unterminated trailing line.
SGML2KSS transforms one SGML document into a XML document that contains the content of the SGML document as well as markup information about the SGML instance such as OmitTag and ShortTag.
PSP RSS Feed Generator is a commandline based PSP RSS Channel file creator intended for beginners' use. It scans a local directory and creates a PSP RSS Channel compatible XML file ready to be hosted on a web site. ***Supports directories with spaces***
xmlconf-lite is a library to read/write XML configure file, it does not depend on any XML SDK, so it is suitable for embed device or when no XML SDK available. Anyway, it is very light and easy to use.
Simple Plain Xml Parser (spxml) is a stream-oriented XML parser that supports pull-model and DOM-model XML parsing.Resulting DOM trees can be read, modified, and saved.
The Introspector enables the programming tools that deal with source code such as the compiler to communicate in a standard and neutral manner reducing the accidental cost of programming. http://github.com/h4ck3rm1k3/