From: sheby k Z. <sh...@re...> - 2005-04-22 07:34:06
|
=A0=0Ai would like to get a program in python that says that the 2 text fi= les are nearly duplicate.=0A=0AMeans:=0Asome text files are having same dat= a but not exactly duplicate. part of it is same content. i want to know tha= t the second file is 10% duplicate, 30% duplicate and so on to the first fi= le.=0A=0Acan we use SequenceMatcher in difflib.py?=0A=0Apls reply me if u h= ave any idea or how to do it or any program? =0A=0A=0A |