OpenWebSpider is an Open Source multi-threaded Web Spider (robot, crawler) and search engine with a lot of interesting features!
The stuff here has no documentation and some of it may never be completed. This is my playground, use at your own risk.
The Rainbow project is an open source initiative to build a comprehensive content management system using Microsoft's ASP.NET and C# technologies. It has ASP.NET 1.1 and ASP.NET 2.0 code bases.
A drop-in framework for adding tagging (folksonomy) capabilities to existing applications
This is a project which has spawned off of a project that I worked on in the past. It is a web-crawler, which searches the sites that are crawled for search terms.
Port of the Google sitemap generator, from Python to Csharp aka C-Sharp aka C# aka .NET aka dotNet.
AJAX,document discovery search system for legal discovery,email search & other types of docs i.e. MSWord,Excel, TIFF, etc. Uses SQL Server FTS - full text search, to index emails & attachments. Documents are stored in SQL Server Written in C# ASP.NET
GSA.NET API is an API for the Google Search Appliance and Google Mini for the .NET platform.
Galilei is a Copernic clone for the GNOME desktop, a GUI internet meta-search application.
High performance distributed in-memory key/value store
Infinispan is an open source, Java based data grid platform. ***IMPORTANT*** Starting with Infinispan 5.0.0.FINAL, Infinispan releases are no longer hosted in Sourceforge. They can now be located in www.jboss.org/infinispan/downloads
Jukuu is a bilingual sentence search engine. According users' queries, It helps users to search similar sentences of another language. With high performance architecture, It can retrieve result sets from billion sentence pairs within 0.0001s.
OAI-SOAP will provide a test bed for the application of SOAP/Web service/UDDI forms of OAI protocol.
Webfear Structiatella is a C# written webspider or mirroring engine, which allowes to specify (even multilevel) content of websites and mirror it into a XML file. It provides a GUI, mirroring engine and a DLL-api for integration into own programs.
wordseg project is a word segment module implemented by C#
wordseg project is a word segment module implemented by C#. It is used to segment text into tokens and to label token's attribute according its context and semantic by front-maximum matching and CRF algorithms. The following are some sentences need to be segmented: 张晓晨和付仲恺一起坐在家（西坝河东里社区）里的沙发上看非诚勿扰。 百度公司的名字源于“众里寻他千百度”这诗句。 After above sentences be segmented by wordseg, the result as follows for each sentence: 张晓晨[PER] 和 付仲恺[PER] 一起 坐 在 家 （ 西坝河东里社区[LOC] ） 里 的 沙发[PDT] 上 看 非 诚 勿扰 。 百度公司[ORG] 的 名字 源于 “ 众 里 寻 他 千百度 ” 这 诗句 。 In above, if a token has some attributes, the attribute result will be appended into the corresponding token within "". Since wordseg has introduced statistics model to segment text by context, for same sub string in different context, dif