From memory (my memory, not the computer), it's reporting matches between the unique kmers (found by meryl) in each of the two assemblies. It might be chaining contiguous kmers into larger matches. These matches form the base of the one-to-one output.
It builds a collision free hash table for one of the sequences, then scans the other sequence, reporting any hits as they occur.
If you would like to refer to this comment somewhere else in this project, copy and paste the following link:
From memory (my memory, not the computer), it's reporting matches between the unique kmers (found by meryl) in each of the two assemblies. It might be chaining contiguous kmers into larger matches. These matches form the base of the one-to-one output.
It builds a collision free hash table for one of the sequences, then scans the other sequence, reporting any hits as they occur.
Uh?