Build lead generation pipelines that pull emails, phone numbers, and company details from directories, maps, social platforms. Full API access.
Generate leads at scale without building or maintaining scrapers. Use 10,000+ ready-made tools that handle authentication, pagination, and anti-bot protection. Pull data from business directories, social profiles, and public sources, then export to your CRM or database via API. Schedule recurring extractions, enrich existing datasets, and integrate with your workflows.
Explore Apify Store
Financial reporting cloud-based software.
For companies looking to automate their consolidation and financial statement function
The software is cloud based and automates complexities around consolidating and reporting for groups with multiple year ends, currencies and ERP systems with a slice and dice approach to reporting. While retaining the structure, control and validation needed in a financial reporting tool, we’ve managed to keep things flexible.
JLinkCheck is an Ant Task written in Java for checking links in websites. It is not just checking one single page, but crawling a whole site like a spider, generating a report in XML and (X)HTML. JReptator will be its succesor with many more features
Pocomos is a cloud-based field service solution that caters to businesses
Built for the pest control industry, but also works great for Mosquito Control, Bin Cleaning, Window Washing, Solar Panel Cleaning, and other Home Service Businesses in need of an easy-to-use software that helps you simplify routing, scheduling, communications, payment processing, truck tracking, time tracking, and reporting.
The project Navigator aims at supporting automated gathering of dynamic information from third party web sites, using their web interface to post queries and to gather replies. Navigator is written in OS-independent java language.
Catalogo is a system for cataloguing resources on a web site. It allows semantic search of information on an intranet using metadata, RDF and ontology concepts. It provides a Catalog server (Java web applications) and a Catalog client (Firefox plug-in).
Roosster.org is a personal "on-demand" search engine. This means, it indexes only items/entries/files/URLs you explicitly tell it to index and provides a full-text-search over indexed items.
Goto http://roosster.org/dev for all details.
This project aims to create a free and open catalog over music that is popular to day including links to audio files and websites, created with our search engine, as well as statistics over genres and artist popularity.
Relased under GNU/GPL.
The Most Powerful Software Platform for EHSQ and ESG Management
Addresses the needs of small businesses and large global organizations with thousands of users in multiple locations.
Choose from a complete set of software solutions across EHSQ that address all aspects of top performing Environmental, Health and Safety, and Quality management programs.
SmartCrawler is a java-based fully configurable, multi-threaded and extensible crawler, which is able to fetch and analyze the contents of a web site by using dinamically pluggable filters
iCalGrabber is a java based application to grab event information from web sites. The events are stored on the filesystem based on Apples iCal format. These .ics files could be read by iCal specific applications like the Mozilla calendar.
Sperowider Website Archiving Suite is a set of Java applications, the primary purpose of which is to spider dynamic websites, and to create static distributable archives with a full text search index usable by an associated Java applet.
myDbSearcher is a search engine for MySQL Databases. It is written in Java. It scans several tables on different databases. A XMLRPC-Server will give you access to the Index.
Currently it runs on http://www.idowa.de/ueberblick/suche/index_html
Robust featureful multi-threaded CLI web spider using apache commons httpclient v3.0 written in java. ASpider downloads any files matching your given mime-types from a website. Tries to reg.exp. match emails by default, logging all results using log4j.
HouseSpider is a Java applet that adds search capability to your web site. It can search by two methods, by spidering through your site or by searching a cached index file. It has 100% i18n (internationalization) support.
Dr. Micheal Kay: "Saxon 8.7 is the first release to be released simultaneously by Saxonica on the Java and .NET platforms." MDP: Mission accomplished! Saxon for the .NET platform from Saxonica is now available and supported via the http://saxon.sf.net
Online news and newspaper harvester - Like RSS Newsreader w/ database. National & International News. Very detailed catches hard to find news articles. Allows resposting of summaries w/ comments to Usenet Newsgroups, complex searches & more.
Gumshoe Desktop Search indexes local files of various formats on a Windows desktop and provides a search GUI. The project is developed in Java. It builds on other opensource projects including Lucene, Luke, SWT, tagsoup, Jakarta POI and others.
COMMON VULNERABILITIES AND EXPOSURES (CVEŽ)
DATABASE BROWSER, CVEBROWSER
A web search engine for the CVE dictionary, targeted to be used on a intranet.
CVEBrowser uses Java Servlets / JSP and MySQL and its designed to work well on RedHat
MaxFisher is a powerful and convenient search engine for personal
computers and intranets. MaxFisher can be thought of as a cross platform, secure and private replacement for Google Desktop.
DialogSearch is an experimental approach to web site searching, which uses the similarity between web pages to retrieve them. It is an alternative to hyperlink-based algorithms such as PageRank and HITS. BEWARE: This is only an experimental prototype.
i-Tor is a set of Tools and Technologies for Open Repositories, based on Linux, Java, MySQL, Mirage and other free components. It harvests OAI and turns databases into Open Archives. It includes similarity, backlinks and related search based on Lucene.