Everything you need to build production-ready agents and models. Access 200+ Google and third-party AI models and tools.
Gemini Enterprise Agent Platform is Google Cloud's comprehensive platform for developers to build, scale, govern, and optimize agents and models. Choose from Google's most advanced models and third-party models like Anthropic's Claude Model Family.
Try It Free
Stop Cyber Threats with VM-Series Next-Gen Firewall on Azure
Native application identity and user-based security for your Azure cloud
Gain integrated visibility across all traffic in a single pass. Deploy Palo Alto Networks VM-Series to determine application identity and content while automating security policy updates via rich APIs.
VuFind® is a library resource discovery portal designed and developed for libraries by libraries. The goal of VuFind® is to enable your users to search and browse through all of your library's resources by replacing the traditional OPAC.
Web data extraction (web data mining, web scraping) tool. It leverages well proved XML and text processing techologies in order to easely extract useful data from arbitrary web pages.
panFMP is a generic framework suitable for harvested XML metadata that is searchable through Apache Lucene without any additional RDBMS. Fields can be defined by XPath allowing for full text queries on all types of fields including numerical ranges.
The code was moved to Github: https://github.com/pangaea-data-publisher/panfmp
Framework for search and display of heterogenous document collections.
NOTICE: This code repository is deprecated. Please visit https://github.com/cdlib/xtf for the latest updates.
Obsolete Description: The eXtensible Text Framework (XTF) is an architecture that supports searching across collections of heterogeneous textual data (XML, PDF, HTML, text, and more), and the presentation of results and documents in a highly configurable manner. Includes highly customized versions of the proven open-source components Lucene and Saxon.
Digital Learning Sciences (DLS) is a mission-centered, not-for-profit organization dedicated to improving learning through the use of digital content and tools.
...webStraktor features a scripting language to facilitate the collection, the extraction and the storage of information available on the web, including images. The scripting language uses elements of the Regular Expression and xPath syntax. The webStraktor scripting language has a small instruction set and its syntax is easy to master.
The standard webStraktor output format is XML based, either in ASCII, UTF-8 or ISO-8859-1 (Latin1) code pages.
webStraktor relies on the Apache HttpClient for retrieving content via the HTTP protocol. It adheres to the Robots Exclusion Protocol and it can be configured to operate in an anonymous way by connecting to the predominant types of web proxy servers.
...
Data migration/conversion library based on STX and XSLT transformation
Infofuze is a Java library and server application that can be used to transform and combine data from various sources into a specific XML or other text output format that can be stored or indexed.
Syndicateme.net ... Ajax Atom 1.0 Syndication Engine Tell your story ... Especially if you are a business along Queen St. in Toronto Canada or King Street Waterloo Canada. Syndication can be from a pop mailbox, and can use XInclude.
Xcerpt is a Query and Transformation Language for XML and Semistructured data. Instead of the navigational approach of XPath-based languages like XSLT or XQuery, Xcerpt uses patterns for querying and is based on concepts of logic programming like unifica
Deploy in 115+ regions with the modern database for every enterprise.
MongoDB Atlas gives you the freedom to build and run modern applications anywhere—across AWS, Azure, and Google Cloud. With global availability in over 115 regions, Atlas lets you deploy close to your users, meet compliance needs, and scale with confidence across any geography.
HXPath is a command line tool useful to extract data from HTML documents. HXPath can select sub trees, like the standard xpath tool, but is also able to read contents and attributes and output them in a bash friendly format. HTML Tidy and HTTP/HTTPS get are built in too.
Lurker is a mailing list archiver designed for capacity, speed, simplicity, and configurability in that order.
Noteworthy features include: google-style searching on all fields, chronology preserving threads, multilingual, and attachment support.
A semester project for SENG 513 at the University of Calgary.
This is an online web-reservation system that will allow a user to make reservations at their favorite restaurant.
x3blog is multiuser weblog system,based on xml/xsl and ajax of browse,DIV+CSS layout measure up web2.0,written in ASP.NET(C#), many database be supported,Lucene.net 2.1 and optimize chinese analyzer by the fulltext search engine. http://blog.muchool.com
Download multiple job postings in XHTML for batch browsing. Can also be input into programs you write to screen, weight, sort, archive, analyse job requirements etc. Currently supports http://www.jobbank.gc.ca
"Oracle Safe Search" is "Google Search" that does **NOT** display the results from the www.dba-oracle.com web site. If you are an Oracle DBA, this Firefox and MSIE plug-ins will save your time if not an even better result !
open-search is a framework to build a p2p web search engine, whereby people mutually form a search engine without the intervention of central servers or a central actor.
with Zip2Map, one can find the geo map of any zip code(now U.S. only). finding the zip code, returns the Map of the location with its name and state name. Google Maps api has been used with PHP-MySql and lots of Ajax to make it a real WEB 2.0 Application
Project consist of 2 parts. One of them is a J2ME app. used to get information such as photo, position, speed & course from GPS and transfers it to the web server. Another one is a web app. which allows to manage and display received data using GoogleMap
JAMP provides several functions to index and manage your media files on resources like storage systems or dvds. The userinterface is webbased and fully written in java.
Google Sitemaps Toolbox (GSToolbox) is a toolbox designed for webmaster to generate, manage and view Google sitemaps files. It is composed of Google Sitemaps Stylesheet (GSStylesheet) and Google Sitemaps Director (GSDirector).
Dr. Micheal Kay: "Saxon 8.7 is the first release to be released simultaneously by Saxonica on the Java and .NET platforms." MDP: Mission accomplished! Saxon for the .NET platform from Saxonica is now available and supported via the http://saxon.sf.net
POPsearch is a desktop search engine that's designed to help you find
information on your computer. This information can then be accessed remotely with RSS feeds, email feeds, or from any computer that has a web browser.