Efficient indexing and searching (1M Weibo 500M data is indexed in 28 seconds, search response time is 1.65 milliseconds, and search QPS is 19K). Support Chinese word segmentation (concurrent word segmentation using the sego word segmentation package, speed 27MB/sec). Support to calculate the proximity distance of keywords in the text (token proximity). When a request to add a document to the index comes in, the main coroutine will send the text to be segmented to a word segmentation coroutine through a channel, and the coroutine will segment the text and send it to a word segmentation through another channel. Indexer coroutines. The indexer coroutine builds an inverted index from the search keyword to the document, and the inverted index table is stored in memory for quick recall. The main coroutine receives the user's request, segments the request phrase in the main coroutine, and then sends it to the indexer through the channel.

Features

  • Support calculation of BM25 correlation
  • Support for custom scoring fields and scoring rules
  • Support online addition and deletion of indexes
  • Support persistent storage
  • Enables distributed indexing and searching
  • Released under the commercial-friendly Apache License

Project Samples

Project Activity

See All Activity >

Categories

Text Editors, Search

License

Apache License V2.0

Follow Wukong

Wukong Web Site

Other Useful Business Software
MongoDB Atlas runs apps anywhere Icon
MongoDB Atlas runs apps anywhere

Deploy in 115+ regions with the modern database for every enterprise.

MongoDB Atlas gives you the freedom to build and run modern applications anywhere—across AWS, Azure, and Google Cloud. With global availability in over 115 regions, Atlas lets you deploy close to your users, meet compliance needs, and scale with confidence across any geography.
Start Free
Rate This Project
Login To Rate This Project

User Reviews

Be the first to post a review of Wukong!

Additional Project Details

Programming Language

Go

Related Categories

Go Text Editors, Go Search Software

Registered

2022-02-10