CLucene - a C++ search engine / Bugs / #221 Memory leak in Analyzer::reusableTokenStream() call

#221 Memory leak in Analyzer::reusableTokenStream() call

Status: open

Owner: nobody

Labels: core (32)

Priority: 9

Updated: 2013-04-04

Created: 2013-04-04

Creator: Xiaoman Dong

Private: No

In file src\core\CLucene\index\DocumentsWriterThreadState.cpp, function void DocumentsWriter::ThreadState::FieldData::invertField()
around line 892, the stream = analyzer->reusableTokenStream(fieldInfo->name, reader); call is supposed to create a stream "reusable", but most of the analyzers are just creating a new stream.

For now I am not sure how to implement the reusable Token stream correctly (maybe should read latest Lucene code), but in my local build I just delete the stream and the memory leak is gone.

Discussion

Xiaoman Dong - 2013-04-04

priority: 5 --> 9
If you would like to refer to this comment somewhere else in this project, copy and paste the following link:

Xiaoman Dong - 2013-04-04

I would like to help with this issue and borrow ideas from Lucene latest development.

The multi-thread support is a good improvement for my project. The bottleneck are actually string inverting and using more threads will reduce time cost.

If you would like to refer to this comment somewhere else in this project, copy and paste the following link:

Memory leak in Analyzer::reusableTokenStream() call

Group

Searches

Help

#221 Memory leak in Analyzer::reusableTokenStream() call

Discussion