OpenWebSpider is an Open Source multi-threaded Web Spider (robot, crawler) and search engine with a lot of interesting features!
The stuff here has no documentation and some of it may never be completed. This is my playground, use at your own risk.
High performance distributed in-memory key/value store
Infinispan is an open source, Java based data grid platform. ***IMPORTANT*** Starting with Infinispan 5.0.0.FINAL, Infinispan releases are no longer hosted in Sourceforge. They can now be located in www.jboss.org/infinispan/downloads
Dr. Micheal Kay: "Saxon 8.7 is the first release to be released simultaneously by Saxonica on the Java and .NET platforms." MDP: Mission accomplished! Saxon for the .NET platform from Saxonica is now available and supported via the http://saxon.sf.net
My Community Portal is a all in one internet portal that offers, forum, groups, chat, your own e-mail, search engine, internet directory, your own home page, poll's, dating services, buddy list, MP3 and file sharing, and many more.
A drop-in framework for adding tagging (folksonomy) capabilities to existing applications
The Anywhere Location Search allows for location searches using a wide range of inputs (address, city/state, zip code, search string, IP address, landmark name, etc).
This is a project which has spawned off of a project that I worked on in the past. It is a web-crawler, which searches the sites that are crawled for search terms.
Port of the Google sitemap generator, from Python to Csharp aka C-Sharp aka C# aka .NET aka dotNet.
AJAX,document discovery search system for legal discovery,email search & other types of docs i.e. MSWord,Excel, TIFF, etc. Uses SQL Server FTS - full text search, to index emails & attachments. Documents are stored in SQL Server Written in C# ASP.NET
GSA.NET API is an API for the Google Search Appliance and Google Mini for the .NET platform.
Tool to check positions in search engines for given keywords and rank them against competitors.
Galilei is a Copernic clone for the GNOME desktop, a GUI internet meta-search application.
Jaguar is highly loaded, distributed and easily scalable search engine. Main parts: NoSQL database server, Сommunication Queue server, multi-threaded Scanner and Analyzer services, Management tool.
Jukuu is a bilingual sentence search engine. According users' queries, It helps users to search similar sentences of another language. With high performance architecture, It can retrieve result sets from billion sentence pairs within 0.0001s.
OAI-SOAP will provide a test bed for the application of SOAP/Web service/UDDI forms of OAI protocol.
The Rainbow project is an open source initiative to build a comprehensive content management system using Microsoft's ASP.NET and C# technologies. It has ASP.NET 1.1 and ASP.NET 2.0 code bases.
A tool for web developers to easily add dynamic Sitemap and RSS feed Xml files for C# and ASP.Net.
WASOLIC is a Web-based Application to Search Online Library Catalogues. Main technologies are .NET, WCF, Silverlight / Mono, Moonlight.
Webfear Structiatella is a C# written webspider or mirroring engine, which allowes to specify (even multilevel) content of websites and mirror it into a XML file. It provides a GUI, mirroring engine and a DLL-api for integration into own programs.
wordseg project is a word segment module implemented by C#
wordseg project is a word segment module implemented by C#. It is used to segment text into tokens and to label token's attribute according its context and semantic by front-maximum matching and CRF algorithms. The following are some sentences need to be segmented: 张晓晨和付仲恺一起坐在家（西坝河东里社区）里的沙发上看非诚勿扰。 百度公司的名字源于“众里寻他千百度”这诗句。 After above sentences be segmented by wordseg, the result as follows for each sentence: 张晓晨[PER] 和 付仲恺[PER] 一起 坐 在 家 （ 西坝河东里社区[LOC] ） 里 的 沙发[PDT] 上 看 非 诚 勿扰 。 百度公司[ORG] 的 名字 源于 “ 众 里 寻 他 千百度 ” 这 诗句 。 In above, if a token has some attributes, the attribute result will be appended into the corresponding token within "". Since wordseg has introduced statistics model to segment text by context, for same sub string in different context, dif