This project is designed to optimize search engine results by managing your web server sitemaps. The software combines both command line processes and a web user interface with a highly configurable architecture.
Ruya is a Python-based breadth-first, level-, delayed, event-based-crawler for crawling English, Japanese websites. It is targeted solely towards developers who want crawling functionality in their projects using API, and crawl control.
Put idle assets to work with competitive interest rates, borrow without selling, and trade with precision. All in one platform.
Geographic restrictions, eligibility, and terms apply.
NOCm stands for `NSUers Online Community Monitor`. It's task is to monitor NOC (`NSUers Online Community`) continuously and notify as soon as there is a new post in NOC.
The OpenWebsuite.org contains the powerful HTML-Editor (X)HTML-Format and the Site analysing tool WebAnalyse. It includes a project management, css editor, link-check (for whole domains) and much more. It was released under the GNU GPL to find helper.
With up to 25k MAUs and unlimited Okta connections, our Free Plan lets you focus on what you do best—building great apps.
You asked, we delivered! Auth0 is excited to expand our Free and Paid plans to include more options so you can focus on building, deploying, and scaling applications without having to worry about your security. Auth0 now, thank yourself later.
Bookmark-Manager is an advanced bookmark management utility for Windows supporting importing/exporting and merging of Internet Explorer favorites, Opera hotlists, Mozilla, Netscape, and Firefox bookmarks, XBEL, and HTML lists.
Aracnis is a Java based framework for building distributed web spiders. These spiders can be used to accomplish a variety of tasks, for example, screen-scraping and link integrity checking.
Auto Proxy Filter Test (APFT) automates the testing of safe and unsafe URLs against a content filtering proxy (such as Dansguardian) and helps prevent regressions. APFT is useful to people who are designing filter rules.
Brain Net. A Distributed Search Engine Network. Aims to be a scalable distributed search engine allowing dynamic connection of nodes. Performance depends on bandwidth.
Yngvi allows Apache Webmasters to determine filesystem locations that are accessible through the server -- including those which have not been explicitly published. It's good for finding inadvertent exposures or unintended configuration inheritances.
rss2imap is a tool which gets RSS (RDF Site Summary) from web and delivers to the IMAP server as an e-mail message. it enables you to use IMAP supported MUA as a RSS reader, and to unify mail check and site update check with ONE client.
JLinkCheck is an Ant Task written in Java for checking links in websites. It is not just checking one single page, but crawling a whole site like a spider, generating a report in XML and (X)HTML. JReptator will be its succesor with many more features
Toke is a webmining toolkit for web exploring, indexing and searching for Java. Toke allows to you crawl public or private web sites, in order to create web estatistics, web Pajek graphs, Lucene indexs and word frequency files for data clustering.
Sperowider Website Archiving Suite is a set of Java applications, the primary purpose of which is to spider dynamic websites, and to create static distributable archives with a full text search index usable by an associated Java applet.
This project provides a system tray application that monitors the status of a project which uses a DART dashboard. Status is displayed by color-coded icons, and message dialogs alert the user when the build status changes.
InSite is a Web site management tool written in perl. It checks link integrity and does some basic content monitoring of your site's files directly on the local disk, which gives it a huge speed advantage over similar tools.
Robust featureful multi-threaded CLI web spider using apache commons httpclient v3.0 written in java. ASpider downloads any files matching your given mime-types from a website. Tries to reg.exp. match emails by default, logging all results using log4j.