Crawl websites, sync to vector databases, and power RAG applications. Pre-built integrations for LLM pipelines and AI assistants.
Build data pipelines that feed your AI models and agents without managing infrastructure. Crawl any website, transform content, and push directly to your preferred vector store. Use 10,000+ tools for RAG applications, AI assistants, and real-time knowledge bases. Monitor site changes, trigger workflows on new data, and keep your AIs fed with fresh, structured information. Cloud-native, API-first, and free to start until you need to scale.
Try for free
Total Network Visibility for Network Engineers and IT Managers
Network monitoring and troubleshooting is hard. TotalView makes it easy.
This means every device on your network, and every interface on every device is automatically analyzed for performance, errors, QoS, and configuration.
A LISP-like XML glue language with an XML syntax.
Ideal for pipelined XML aggregation, transformations
and filtering with accessors to a content repository.
Embeddable Java implementation includes XSLT engine XT,
servlet, command line and applet.
jGetFile is a command-line scriptable recursive file downloader for the web. Where other downloaders fail, jGetFile succeeds in downloading the files you want with simplicity and ease of use.
It's a modern take on desktop management that can be scaled as per organizational needs.
Desktop Central is a unified endpoint management (UEM) solution that helps in managing servers, laptops, desktops, smartphones, and tablets from a central location.
Indexer of FTP sites so that the sites are searchable from a central location. Credentials of sites and the data are stored in an SQL database. No front-end is included at this time.
Automatically generates echo2 applications from a domain model (scaffolding). Goal is to integrate with ide plugins to provide a quick process for getting a database driven Echo2 AJAX application running.
Cross-platform searchable CD-ROM. Vicaya is a search engine and indexing tool for use on a local file system or CDROM, written in Java and based on Apache Nutch, and Tomcat. The goal is to replicate a website on a CD-ROM to be used on any platform.
HtmlClient provides an SGML/HTML/XHTML parser and connection client making web-spidering as easy for developers as actually surfing the web with a premade browser. Based on Apache's HttpClient.
Run applications fast and securely in a fully managed environment
Cloud Run is a fully-managed compute platform that lets you run your code in a container directly on top of scalable infrastructure.
Run frontend and backend services, batch jobs, deploy websites and applications, and queue processing workloads without the need to manage infrastructure.
Scopes is a new way to use Java EE web application scopes that provides a web-platform-independent state object representing all web contexts and an extensible framework to create new scopes besides "request", "session" and "application"
WebTheme is a Java-based presentation framework for developing reusable web components called themes. Themes are composed of JSP layout views and CSS skins. It is simpler to use and more object-oriented than similar frameworks such as Tiles and SiteMesh.
Client-side and client/server AJAX applets, games, web widgets, plugins based on the Google Web Toolkit for use in your websites, blogs, CMS systems, and for display in Internet Explorer, Mozilla, Firefox, Safari, Opera.
A modular (eXtensible) FTP Server written in Java, providing the ability to customise what happens to the FTP requests using JavaBean "Handlers" configured via SpringFramework. This allows almost unlimited customisations to be easily plugged in
Denebola is a Java EE view template system, which works like DHTML in server-side. Javascript (and other scripting languages) is used to manipulate 100% pure (X)HTML templates via CSS2 Selector style element filters to produce data-driven web pages.
Helona is a project to provide xml based plugins (modules) and themes for Apache cocoon, forrest and lenya based web applications. The project is created to provide code that are not meeting ASF policies (e.g. including GNU LGPL licenced code).
Hoople is like attribute-oriented programming for URLs. Rather than having the configuration information and “URL Logic” spread throughout the site, you create a single XML configuration file that contains all the “logic” for each URL on the file
QuickWCM is a Web Content Manager (WCM) with a very easy to use web-based interface, seamless security model, integrated search engine and more. QuickWCM runs on JSR-170 repository and is easy to extend with JSR-168 portlets.
Controls the lifecycle of portlet-like JSP components, does not require to install a portal engine. Components are either reloaded using traditional synchronous HTTP request/response cycle (Non-Ajax mode), or updated in-place (Ajax mode).
GWT-Components aims to collect contributions that extend and enhance the Google Web Toolkit (GWT). The contributions could potentially add features (such as drag-and-drop, for example), components, and integration with existing JavaScript libraries.
The UDDI Browser project aims to provide a leading Open Source, pure Java-based application for the querying and manipulation of both public and private UDDI registries.
A dvd library with content management system (CMS) it will works with a self-written compact HTTP Server (no Apache, IIS, etc.).
It's a semester project by two students at FH Coburg (Germany).