Crawl websites, sync to vector databases, and power RAG applications. Pre-built integrations for LLM pipelines and AI assistants.
Build data pipelines that feed your AI models and agents without managing infrastructure. Crawl any website, transform content, and push directly to your preferred vector store. Use 10,000+ tools for RAG applications, AI assistants, and real-time knowledge bases. Monitor site changes, trigger workflows on new data, and keep your AIs fed with fresh, structured information. Cloud-native, API-first, and free to start until you need to scale.
Try for free
Easy-to-use Business Software for the Waste Management Software Industry
DOP Software’s mission is to streamline waste and recycling business’ processes by providing them with dynamic, comprehensive software and services that increase productivity and quality of performance.
Bugkilla is a set of java tools for the functional test of J2EE Web Applications.
Specification and execution of tests will be automated for web front end and business logic layer.
One goal is to integrate with existing frameworks and tools.
SiteEngine is a Servlet based web subsystem purposed to simplify of managing of site content (but it is not a Content Management System, it is just a lib).
As a healthcare provider, you should be paid promptly for the services you provide to patients. Slow, inefficient, and error-prone manual coding keeps you from the financial peace you deserve. XpertDox’s autonomous coding solution accelerates the revenue cycle so you can focus on providing great healthcare.
This will be a PHP based website system. It will have an xml database for all content (no need for database software.) Also it will have the ability to use a template system that can easily accomodate flash and WYSYWIG tpl.
The Haskell Web Publisher shall allow website implementation using the functional programming language Haskell. Thereby, accuracy of URIs, data validity, and compliance to security restrictions shall be assured by compiler checks.
Memephage is an automated web log (blog). It passively gathers and summarizes links from various places. Currently: IRC, social MUDs, e-mail, and web browsers. Uses the POE multitasking and networking framework for Perl.
Plan smart spaces, connect teams, manage assets, and get insights with the leading AI-powered operating system for the built world.
By combining AI workflows, predictive intelligence, and automated insights, OfficeSpace gives leaders a complete view of how their spaces are used and how people work. Facilities, IT, HR, and Real Estate teams use OfficeSpace to optimize space utilization, enhance employee experience, and reduce portfolio costs with precision.
HyperSpider (Java app) collects the link structure of a website. Data import/export from/to database and CSV-files. Export to Graphviz DOT, Resource Description Framework (RDF/DC), XML Topic Maps (XTM), Prolog, HTML. Visualization as hierarchy and map.
W3mon is a tool that can be used to monitor a set of web sites.
By continually polling such web sites (say, once per hour), the
respective webmasters can be notified by e-mail whenever an outage
occurs.
Protect your bandwith and increase traffic to your vBulliten web site.
This project is to keep the location of files a secret that are on FTP of a vBulliten forums website. Avoid using the vB attachments database and hide file locations.
LCat is a php/mysql cataloging system for hyperlinks, intended for use on websites. It is build around a set of objects providing the basic functionality and complemented with a set of scripts and templates to perform communication with the user.
Orome is a tool for automating System or Acceptance tests (also Unit test though this is not the focus) for web-based systems. Orome takes a set of static HTML pages defining a walkthrough of (part of) the systems and tests it against the running system.
aWebVisit analyses web logfiles for visit information like: entry/transit/exit points, internal links followed, length of visit and time spent.
aWebVisit-Map then allows you to examine the links followed by your visitors to and from each web page...
This Linklist uses a MySQL backend. It can handle mutiple Linklists in only 3 tables. Every Linklist has also Categories and Moderator functions. The Admin page has rich amount of features including easy un-/install. The design is clear but cool.
RoboWeb is set of tools to automate the creation of test suites for any kind of web sites and web applications.
RoboWeb uses a proxy recorder to capture user actions as she navigates a site, which later can be reproduced automatically. Site responses
TightURL takes a very long URL as input, and returns a very short URL.
Example (long URL)
http://service.domain.net/de/cgi/g.fcgi/startpage?site=greetings&CUS-GHTOO=43
Exampe (short URL)
http://www.yourdomain.com/174733788
phpSitemapNG is a free Google Sitemaps generator written in PHP, but also generates RSS-based, txt-based and HTML-based sitemap files. It will spider your website and can also index the filesystem.
Programa escrito em PHP para download e upload de arquivos em FTPs e servidores grátis, download de Torrents e verificação de status de links em servidores grátis.