From: Dom L. <ci...@ho...> - 2001-09-10 13:31:42
|
The information that wvSummary uses comes from the MSWord DOC summary streams - i.e. MSWord wrote in its file format that there were N characters and X words in the document. We're just outputting what MSWord saved, basically. If you want to get a better character and word count, use the wvText utility and the 'wc' program. It'll do an ok job in most situations. Because MSWord is a dynamic reflow engine, we'll never get the line and page counts 100% all of the time no matter what we do. Hope that this helps, Dom >From: "Ajit Sadasivan" <aj...@sp...> >Reply-To: "Ajit Sadasivan" <aj...@sp...> >To: <wvw...@li...> >Subject: [Wvware-users] Doc Character Count Using WV >Date: Fri, 7 Sep 2001 20:28:25 -0400 > >Hi, > > I am trying to use the wv utility to >count the number of characters in a word document.We use >the DocCount as our reference. > > What we are doing is parsing the text >file generated by wv.In addition to the ASCII characters, >we need to take into account the following while doing the count. >(1)Blank Lines >(2)Spaces >(3)Tabs >(4)Hard Returns >(5)Headers and footers every appearance > > But the text file created by wv has some inconsistencies >in how the above is handled.For example it adds a Line feed and >three spaces to each line.How do I disable that ? >WV also adds a line feed and around 26 spaces for each tab in the >original word document.Even after removing that the word count >doesn't tally.So we would like to know what are all the other >changes which WV makes in the generated txt file. > >In there any way we could disable that in wv itself ? > >With Regards >Ajit.S > _________________________________________________________________ Get your FREE download of MSN Explorer at http://explorer.msn.com/intl.asp |