From: John Graham-C. <jgr...@us...> - 2005-03-07 22:37:26
|
Update of /cvsroot/popfile/engine/Classifier In directory sc8-pr-cvs1.sourceforge.net:/tmp/cvs-serv11112 Modified Files: MailParse.pm Log Message: Fix ugly comments Index: MailParse.pm =================================================================== RCS file: /cvsroot/popfile/engine/Classifier/MailParse.pm,v retrieving revision 1.217 retrieving revision 1.218 diff -C2 -d -r1.217 -r1.218 *** MailParse.pm 27 Dec 2004 16:51:26 -0000 1.217 --- MailParse.pm 7 Mar 2005 22:37:14 -0000 1.218 *************** *** 54,57 **** --- 54,58 ---- # Symbols in EUC-JP chars which cannot be considered a part of words + my $symbol_row1_euc_jp = '(?:[\xA1][\xA1-\xBB\xBD-\xFE])'; my $symbol_row2_euc_jp = '(?:[\xA2][\xA1-\xFE])'; *************** *** 59,71 **** my $symbol_euc_jp = "(?:$symbol_row1_euc_jp|$symbol_row2_euc_jp|$symbol_row8_euc_jp)"; [...1470 lines suppressed...] ! # mode(-w is passed to Kakasi as argument). Both input and ouput ! # encoding are EUC-JP. # # Since Text::Kakasi is not thread-safe, we use it under the # control of a semaphore to avoid a crash if we are running on ! # Windows in a forked process. # # Note that this requires us to detect a sub-process by looking at --- 2562,2572 ---- require Text::Kakasi; ! # Split Japanese line into words using Kakasi Wakachigaki mode(-w ! # is passed to Kakasi as argument). Both input and ouput encoding ! # are EUC-JP. # # Since Text::Kakasi is not thread-safe, we use it under the # control of a semaphore to avoid a crash if we are running on ! # Windows in a forked process. # # Note that this requires us to detect a sub-process by looking at |