ARADO RSS Feed Reader is a URL Database for Websearch and RSS Feed Reading, which saves your added Bookmarks & RSS-Feeds and syncs newest URLs with your connected devices. Store and Search your all your URLs in ARADO. As framework c++ / Qt is used.
eZimDMS based on DocMgr is a Document Management System(DMS). Which provide more commerical features, such as upload progress bar, enchanced security, more and more. Our goals would like to provide a full & complete document management system
Discontinued lightweight Desktop-Files/SMB/FTP crawler and search engine.
wordseg project is a word segment module implemented by C#
wordseg project is a word segment module implemented by C#. It is used to segment text into tokens and to label token's attribute according its context and semantic by front-maximum matching and CRF algorithms. The following are some sentences need to be segmented: 张晓晨和付仲恺一起坐在家（西坝河东里社区）里的沙发上看非诚勿扰。 百度公司的名字源于“众里寻他千百度”这诗句。 After above sentences be segmented by wordseg, the result as follows for each sentence: 张晓晨[PER] 和 付仲恺[PER] 一起 坐 在 家 （ 西坝河东里社区[LOC] ） 里 的 沙发[PDT] 上 看 非 诚 勿扰 。 百度公司[ORG] 的 名字 源于 “ 众 里 寻 他 千百度 ” 这 诗句 。 In above, if a token has some attributes, the attribute result will be appended into the corresponding token within "". Since wordseg has introduced statistics model to segment text by context, for same sub string in different context, dif
To uses a web interface for browse any predefined directory on UNIX and Windows based platforms. This package can easy updated to embeded into other project. It had been modified as a postNuke plugin etc..
FirteX is a high performance,full-featured text indexing and retrieval platform.It provides a flexible and feasible experiment platform for researchers,as well as a scalable platform for Web search development.It is very fast,and well support for Chi
Jukuu is a bilingual sentence search engine. According users' queries, It helps users to search similar sentences of another language. With high performance architecture, It can retrieve result sets from billion sentence pairs within 0.0001s.
OpenCLAS is an open source implementation of ICTCLAS (Institute of Computing Tech.,Chinese Lexical Analysis), which contains 3 language branches, such as C++, Java and C#. The library can be used to segment Chinese sentence to words with tags (POS).
SharpResource is a smart web resources retrieval engine for script based/auto modes internet data mining using c#. It is component-driven and fully customizable. It is aimed to be a versatile and robust library, not a system.
Solo(Search Online Ores) is an Eclipse RCP based meta search engine, end users can config it to search in sites they care, such as online book stores, forums, news portals and other general search engines like google, and return structured results.
Uni-wordsplit aimed to provide a unicode(lexical analysis/word splitter) system.Especially designed for CJK(China/Japan/Korea) users. The Code based on Mozilla-XPCOM code.