An object relational-mapping (ORM) library for Java
Hibernate is an Object/Relational Mapper tool. It's very popular among Java applications and implements the Java Persistence API. Hibernate ORM enables developers to more easily write applications whose data outlives the application process. As an Object/Relational Mapping (ORM) framework, Hibernate is concerned with data persistence as it applies to relational databases (via JDBC).
Google Search, Google Site Search, Google News from the terminal
googler is a power tool to Google (Web & News) and Google Site Search from the command-line. It shows the title, URL and abstract for each result, which can be directly opened in a browser from the terminal. Results are fetched in pages (with page navigation). Supports sequential searches in a single googler instance. googler was initially written to cater to headless servers without X. You can integrate it with a text-based browser. However, it has grown into a very handy and flexible utility that delivers much more. For example, fetch any number of results or start anywhere, limit search by any duration, define aliases to google search any number of websites, switch domains easily... all of this in a very clean interface without ads or stray URLs. The shell completion scripts make sure you don't need to remember any options. googler isn't affiliated to Google in any way. Demo: https://asciinema.org/a/85019
Forked from https://sf.net/p/fmd/
The Free Manga Downloader (FMD) is an open source application written in Object-Pascal for managing and downloading manga from various websites. This is a mirror of main repository on GitHub. For feedback/bug report visit https://github.com/riderkick/FMD
Virtuoso is a scalable cross-platform server that combines Relational, Graph, and Document Data Management with Web Application Server and Web Services Platform functionality.
Search the web for video, music, books, software, games, subtitles and
FileMasta is intended to provide a very powerful web server indexing and search service allowing you to find a file among millions of files located on public servers around the internet. The search engine is powered by a database that holds information about all the files web servers have. The information about the files is gathered by an intelligent web crawler that runs every 2 to 4 days. It keeps the database clean and up-to-date with the previous contents and new entries for each web server address submitted by users. All web addresses this app uses for indexing are submitted by users only, our crawler also try to give more priority to the servers that have the most up-time, if a web server isn't checked successfully in a period of time, usually 10-15 days, it is automatically deleted from our database. A list of the servers we index are stored in our open directories.
Andy's PHP Knowledgebase using MySQL is a database driven Web Application for storing, searching and updating article content for a knowledgebase. Andy's PHP Knowledgebase is easily customized and has potential for a variety of creative uses.
A search application to watch and download movies and TV shows
A federated search desktop application to read about, preview, watch, and download any movie and television titles that are being shared online.
Interleave is a business process management application. It enables you to model your business process and make it available online. It's meant to replace processes which currently rely on paper or spreadsheets and it has a good workflow engine.
WACS is a tool for building Adult Web Sites; equally suitable for managing a private collection or building a commercial site. It has many powerful features including dynamic filtering, model catalogs, automatic download and powerful search engine.
An open source search engine with RESTFul API and crawlers
OpenSearchServer is a powerful, enterprise-class, search engine program. Using the web user interface, the crawlers (web, file, database, etc.) and the client libraries (REST/API , Ruby, Rails, Node.js, PHP, Perl) you will be able to integrate quickly and easily advanced full-text search capabilities in your application: Full-text with basic semantic, join queries, boolean queries, facet and filter, document (PDF, Office, etc.) indexation, web scrapping,etc. OpenSearchServer runs on Windows and Linux/Unix/BSD.
Archive your personal history
ResCarta Toolkit offers an open source solution to creating, storing, viewing, and searching digital collections. Applications in the toolkit let users create and edit metadata, convert data to open standard ResCarta format, index and host collections.
Desktop application for proxy searching
You want to find free proxy, but it is so hard to do it manually? Just try Burd's Proxy Searcher program. It is looking for list of proxies in Internet with using of public search engines, checks if those proxies works in your Internet segment, gathers additional information. If you want to be anonymous and don't want to spend much time for manual search then this program was developed especially for you. Tags: the free proxy, proxy list, proxies, proxy for free, proxy providers
panFMP is a generic framework suitable for harvested XML metadata that is searchable through Apache Lucene without any additional RDBMS. Fields can be defined by XPath allowing for full text queries on all types of fields including numerical ranges.
Easy web based address book and contact manager
Address Book Express is a super simple, intuitive & responsive web-based address book and contact manager written in PHP with MySQL backend featuring groups, addresses, e-Mails, phone numbers, contact photo, birthdays with admin panel and many more.
Scan country IP ranges in CIDR format or scan links
Scan country IP ranges in CIDR format or scan links. Links will be saved in MySQL database.
cpDetector is a proxy for codepage detection of documents. It delegates to multiple instances that try to detect the codepage by different techinques. A command line executeable is shipped that allows to sort documents by codepage.
DuckDuckGo from the terminal
ddgr is a cmdline utility to search DuckDuckGo from the terminal. While googler is highly popular among cmdline users, in many forums the need of a similar utility for privacy-aware DuckDuckGo came up. DuckDuckGo Bangs are super-cool too! So here's ddgr for you! Unlike the web interface, you can specify the number of search results you would like to see per page. It's more convenient than skimming through 30-odd search results per page. The default interface is carefully designed to use minimum space without sacrificing readability. ddgr isn't affiliated to DuckDuckGo in any way. Demo: https://asciinema.org/a/151849
Zoozle 2008 - 2010 Webpage, Tools and SQL Files
Download search engine and directory with Rapidshare and Torrent - zoozle Download Suchmaschine All The files that run the World Leading German Download Search Engine in 2010 with 500 000 unique visitors a day - all the tools you need to set up a clone. Source Code used and enhanced by: https://www.artikelschreiber.com/en/ Code Contains: - PHP Files for zoozleNET, zoozleORG - Perl Crawler for gathering new content to database and all other cool tools i have created (c) Sebastian Enger 2005-2014
Google Dublicate Content Problem Solver - Open Source SEO & SEM Tools
Programm to solve the google dublicate content problem - change texts to slip through googles PANDA 4.0 Update. Open Source Tool for Search Engine Optimization (SEO & SEM) used for automatic content processing. Code based on IOSEC ADDONS: https://www.artikelschreiber.com/en/ https://www.artikelschreiber.com/ Code Contains: - Perl Source code, language databases and more
Easy Spider is a distributed Perl Web Crawler Project from 2006
Easy Spider is a distributed Perl Web Crawler Project from 2006. It features code from crawling webpages, distributing it to a server and generating xml files from it. The client site can be any computer (Windows or Linux) and the Server stores all data. Websites that use EasySpider and Perl/PHP Backends: https://www.artikelschreiber.com/en/ Webcrawlers are mostly the first thing to start programming at if you start your programming career. It is fun to look at some code that is few years ago and to see how one has improved himself. (c) Sebastian Enger 2005-2015
Search engine and data mining applications and ClueWeb datasets.
The Lemur Project develops search engines, browser toolbars, text analysis tools, and data resources that support research and development of information retrieval and text mining software, including the Indri search engine in C++, the Galago search engine research framework in Java, the RankLib learning to rank library, ClueWeb09 and ClueWeb12 datasets and the Sifaka data mining application.
A general purpose source code indexer and cross-referencer that provides web-based browsing of source code with links to the definition and usage of any identifier. Supports multiple languages. Up-to-date information in http://lxr.sourceforge.net
JuniCoder is a Java project that uses unicode as a base for decoding and encoding formats that invented workarounds to express characters not covered by ASCII. Decoders translate those inventions to unicode. Encoders encode to these inventions.
A collection of Dokuwiki plugins that will enable the user to spatially enable and use the wiki, currently we have: openlayersmap (a map), geotag (ways of geotagging a page)
List al URLs present in requested URL in absolute format
This php program extracts all URLs present on the requested URL, in absolute path.