Crawl websites, sync to vector databases, and power RAG applications. Pre-built integrations for LLM pipelines and AI assistants.
Build data pipelines that feed your AI models and agents without managing infrastructure. Crawl any website, transform content, and push directly to your preferred vector store. Use 10,000+ tools for RAG applications, AI assistants, and real-time knowledge bases. Monitor site changes, trigger workflows on new data, and keep your AIs fed with fresh, structured information. Cloud-native, API-first, and free to start until you need to scale.
Try for free
Atera all-in-one platform IT management software with AI agents
Ideal for internal IT departments or managed service providers (MSPs)
Atera’s AI agents don’t just assist, they act. From detection to resolution, they handle incidents and requests instantly, taking your IT management from automated to autonomous.
A Perl script that splits a long HTML file into separate inter-linked pages, according to the headings in the original file. Useful for maintaining both a print version and a browsable version of a site.
Tigerleaf's simple XML documents build rich, manageable sites and PDF publications. Tigerleaf eases XML authoring and publishing with versioning, code generation, management and workflow features.
GuitarTeX is based on the idea of Chord. It takes a Chord file containing Chordpro directives to produce good-looking and easy-to-play song sheets for guitarists in PostScript or PDF format.
GNU FriBidi is the Free Implementation of the Unicode Bidirectional Algorithm. GNU FriBidi development has been moved to GitHub. See https://github.com/fribidi/fribidi/
Provides a simple Java .jar file for converting Docbook files to HTML, FO or XHTML and includes all the XSL files needed. Great for cross platform Docbook conversions and Ant build scripts.
Total Network Visibility for Network Engineers and IT Managers
Network monitoring and troubleshooting is hard. TotalView makes it easy.
This means every device on your network, and every interface on every device is automatically analyzed for performance, errors, QoS, and configuration.
MacBibTk is a Mac compatible version of Peter Corke's tkbibtex (release 9), a BibTeX file editor and browser. BibTeX is a reference/citation system for use with LaTeX. MacBibTk runs on all platforms with Tcl/Tk ports.
J2ME Memopad is a simple MIDP application designed to allow storage and retrieval of notes. It will have the ability to search and generate a list of results, as well as categorize your memos. The basic design of the memopad is similar to the Palm.
FileExtender is a Perl script to evaluate embedded SQL statements in any kind of text file (incl. HTML files) and extends these files with results from the database queries.
The SchemaWalker is a Java application able to read a any schema and produce XForms web pages for user selected nodes grouped into webpages to allow editing of XML data files.
This project will provide tools for user to convert existing web sites, blogs and documents with non-standard Myanmar font data to Unicode 5.1 compatible data.
(Zawgyi to Unicode 5.1, WinMyanmar system to Unicode 5.1 etc.)
Open-Tamil is a full featured Tamil text processing library in Python. It works fully in Python 2, 3.
Published via pip - python package index.
See: https://pypi.python.org/pypi/Open-Tamil/0.67
Adapt is data conversion language developped in 1984 by Norman W. Molhant and Christophe Dupriez. It has been used in many circumstances, it translated itself in many programming environment and it should evolve now toward modern environments like Java.