BigQuery automates data prep, analysis, and predictions with built-in AI assistance.
Build and deploy ML models using familiar SQL. Automate data prep with built-in Gemini. Query 1 TB and store 10 GB free monthly.
Try Free
MongoDB Atlas runs apps anywhere
Deploy in 115+ regions with the modern database for every enterprise.
MongoDB Atlas gives you the freedom to build and run modern applications anywhere—across AWS, Azure, and Google Cloud. With global availability in over 115 regions, Atlas lets you deploy close to your users, meet compliance needs, and scale with confidence across any geography.
Last update on tue mar 26 16:25 UTC 2012
The Web Crawler Security is a python based tool to automatically crawl a web site. It is a web crawler oriented to help in penetration testing tasks. The main task of this tool is to search and list all the links (pages and files) in a web site.
The crawler has been completely rewritten in v1.0 bringing a lot of improvements: improved the data visualization, interactive option to download files, increased speed in crawling, exports list of...
ftpsearch is a web based indexing searchengine for the ftp server, which supports regular expressions in queries, new files monitoring, some fancy stats and so on.
Transform your applications and workflows into powerful agentic systems at global scale.
Gemini Enterprise Agent Platform lets you rapidly build, scale, govern and optimize production-ready agents grounded in your organization's data. The platform enables developers to build custom or pre-built agents for virtually any use case. New customers get $300 in free credits.
Hatta is a wiki engine that uses a Mercurial repository for storing the pages. You can run it as a web application on your server, or locally on your computer. You can also do both, and synchronize the repositories once in a while.
Reporting engine library written in C. Create one XML file and generate PDF, HTML, TXT, and CSV reports based on queries. Has support for MySQL, PostgreSQL, ODBC. Bindings for PHP, Java, Python.
BTG Web SearchEngine is developed for students or developers who are interested in searchengine technology. The engine web will be written in C++, Java, Perl,and Python. At the beginning of the project, a simple searchengine design will be used.
Give your IT, operations, and business teams the ability to deliver exceptional services—without the complexity.
Freshservice is an intuitive, AI-powered platform that helps IT, operations, and business teams deliver exceptional service without the usual complexity. Automate repetitive tasks, resolve issues faster, and provide seamless support across the organization. From managing incidents and assets to driving smarter decisions, Freshservice makes it easy to stay efficient and scale with confidence.
A HTML scraper that uses machine learning frameworks to extract labelled fields from raw HTML. The project also involves the development of a tool to display the semi structured data generated by the scraper component.
a small collection of python 3000 scripts/modules used to automate searching craigslist.org cities and categories for interesting stuff; these scripts currently use html screen scraping, since craigslist currently has no api
Wixi is a multi-platform wiki application for the desktop. It is written in python/wxpython and does anything you might expect from a wiki application. Generating html from text is done by txt2tags, wich is simple and easy to learn.
zSearch is a simple python based crawler and searchengine. Raw HTML are stored in bzip2 archives, the index is created using pylucene, and twsited is used to provide internal http server. Results are sent back as XML over HTTP.
TallStreet.com is a new searchengine concept where users make investments, with fictional money, in their favourite websites and the rankings are determined entirely by the users.
metamax_en is a quite simple but very usefull webtool to generate HTML-Meta-Tags. It can be used to improve the search-relevance of your own page. Also you can place it as a free tool in your download-area.
See: http://www.eudict.eu/metamax_en.html
htmltmpl is a templating engine for Python and PHP. It is targeted to web application developers, who want to separate program code and design (HTML code) of their projects. Even webdesigners can easily learn its simple but powerful template language.
Bookmark-Manager is an advanced bookmark management utility for Windows supporting importing/exporting and merging of Internet Explorer favorites, Opera hotlists, Mozilla, Netscape, and Firefox bookmarks, XBEL, and HTML lists.
Wiki engine written in Python, OS independent and using dbm files to store data. Has diff, content and link search, recent changes, file upload and user auth. engine (http://bane.3dnet.co.yu/bpp)
Each user can run their own threaded searchengine and contribute to a global search database searching only the sites they want. It is built using Turbogears.
Fast SMB Search is a searchengine for local SMB-based networks (e.g Windows networks). It's key feature is the ability to quickly search for a file in a large network. Also supports FTP search, so project name is not strict
A content generation engine written in Python used for generating content for HTML and textual output. Integrates with Apache to form a web framework that uses XML templates and can embed Python.
Cheshire3 is a fast Z39.50, SRW, XML searchengine, written in Python for extensability and using C libraries for speed. Next generation of the Cheshire system (http://cheshire.berkeley.edu) and designed around a distributable, object oriented model.