Deploy pre-built tools that crawl websites, extract structured data, and feed your applications. Reliable web data without maintaining scrapers.
Automate web data collection with cloud tools that handle anti-bot measures, browser rendering, and data transformation out of the box. Extract content from any website, push to vector databases for RAG workflows, or pipe directly into your apps via API. Schedule runs, set up webhooks, and connect to your existing stack. Free tier available, then scale as you need to.
Explore 10,000+ tools
Run applications fast and securely in a fully managed environment
Cloud Run is a fully-managed compute platform that lets you run your code in a container directly on top of scalable infrastructure.
Run frontend and backend services, batch jobs, deploy websites and applications, and queue processing workloads without the need to manage infrastructure.
The EpiDoc Collaborative is developing specifications and tools for standards-based, digital publication and interchange of scholarly and educational editions of documentary and literary texts like inscriptions and papyri.
The link below will take you to the EpiDoc home page on this site.
Canorus is a free cross-platform music score editor. It supports an unlimited number and length of staffs, polyphony, a MIDI playback of notes, chord markings, lyrics, import/export filters to formats like MIDI, MusicXML, ABC Music, MusiXTeX and LilyPond
Cookbook Style Document for DocBook Customizations
This project has been moved to GitHub: https://github.com/tomschr/dbcookbook/
The DoCookBook project aims to create an open source book about DocBook and the DocBook XSL stylesheets written as a cookbook and released under a Creative Commons license.
A Python script that can be used to get information on TV shows and Movie Shows from thetvdb.org and themoviedb.org. This is an learning experience and anybody can chime in on everything.
It's a modern take on desktop management that can be scaled as per organizational needs.
Desktop Central is a unified endpoint management (UEM) solution that helps in managing servers, laptops, desktops, smartphones, and tablets from a central location.
Epicxml is an xql like command line interpretor for xml files management. Used as command: Epicxml is a shell friendly, xpath like query tool, wich allows you to navigate, find/create/update/delete/print nodes. It is based on picxml.
pyxser stands for python xml serialization and is a python object to XML serializer that validates every XML deserialization against the pyxser 1.0 XML Schema. pyxser is written entirely in C as a python extension.
HTTP functional and non-functional (load and performance) toolkit based on jython/grinder (http://grinder.sf.net) ...includes capabilities to support: SOA services, REST, json/xml encoding, AES and WS security ... and a stub to collect requests
The London Datastore (http://data.london.gov.uk) was created by the Greater London Authority (GLA) as an innovation towards freeing London’s data. This SourceForge Project will be used to Open Source our development efforts surrounding data formats
now here: https://github.com/plastex/plastex
plasTeX is a Python-based LaTeX document processing framework. It gives DOM-like access to a LaTeX document, as well as the ability to generate mulitple output formats (e.g. HTML, DocBook, tBook, etc.).
OrangeHRM provides a world-class HRIS experience and offers everything you and your team need to be that HR hero you know that you are.
Give your HR team the tools they need to streamline administrative tasks, support employees, and make informed decisions with the OrangeHRM free and open source HR software.
C4me aims to provide a convenient way of editing XML files (and, in a distant future, even more modding-related files) for modifications for Sid Meier's Civilization 4. It's now in its infancy and not really usable - join and help changing that!
This is a pure-Python XPath evaluator based on ElementTree. It supports a substantial fraction of the XPath 1.0 specification, but only the self, child, and attribute axes. The parser underlying the evaluator attempts to handle all of XPath 1.0.
Banking and Retail online transaction processing platform. Software is developed utilizing ACE (Adaptive Communications Environment) which is a multi-platform middleware that handles communications, events and multi-threading. C-Tree database.
Software to translate the geographical format SOSI into KML or other spatial data formats. SOSI is the Norwegian National format for exchange of spatial data.
sabnzbd-xmlgui is an Ajax based frontend built around sabnzbdplus. It also provides an xml based API for other applications to easily connect with sabnzbd while at the same time maintaining the existing web based ajax gui.
This project will provide translation of mathematical content, from TeX to MathML and vice-versa, and to graphics formats, as a web service. TeX, running as a daemon, is used for mathematical typography.
4Suite is a platform for XML processing and knowledge-management, consisting of a library of integrated tools for XML processing, and an XML data repository and server with a rules-based engine.
wxBrowser is an application browser based on the wxWidgets GUI framework. It's similar to a regular old web browser only, instead of reading HTML and displaying content it reads XML and executes presentation logic (wxPython) in a client side application.
Image2DocInfo has been made to quickly tag digital pictures. A GUI allows you to set attributes for an image, and then store them in XML files. Those files follow the Dublin Core naming scheme and are stored in the same directories than the pictures.
Splice is a Python-based content aggregation and publishing platform. It provides all of the features of a common weblog combined with synchronization capabilities, allowing content to be slurped in from external sources, classified, and published.