Share

HTML Tidy

Tracker: Bugs

5 Using tidy.exe from Homesite leaves <o:p> tags - ID: 1067112
Last Update: Settings changed ( arnaud02 )

As reported in bug 634889 (closed), but this is
occurring with the latest build of the windows binary
(6 October 04).

Numerous empty <o:p></o:p> tags are left after calling
tidy.exe from within Homesite 5.5, with 'clean up
Word2000' set.

See attached Word "HTML" source.

David Nicholls


David Nicholls ( dcnicholls ) - 2004-11-16 03:06

5

Closed

Fixed

Nobody/Anonymous

HTML/XHTML Parser

Current - all platforms

Public


Comments ( 20 )

Date: 2005-10-05 13:15
Sender: arnaud02Project Admin

Logged In: YES
user_id=566665

Let's set this issue to a rest. Please open a new bug if it is
not fixed.
We have no control of http://dev.int64.org/tidy.html so we
cannot help.



Date: 2005-10-05 13:00
Sender: dcnicholls

Logged In: YES
user_id=647087

OK, thanks for the heads up. I have no way to compile (at
least none I have time to attempt), so I'll have to see if I
can find anyone with the appropriate capability. But I
think it's unlikely, so I'll probably have to wait until the
next Windows exe is available on the binaries page. DN


Date: 2005-10-05 12:51
Sender: arnaud02Project Admin

Logged In: YES
user_id=566665

As the fix was added in August, you need to get a more
recent version. tidy.sf.net does not produce windows
executable. Therefore, you will have to build it yourself or
find somebody to do so.


Date: 2005-10-05 12:47
Sender: dcnicholls

Logged In: YES
user_id=647087

OK, I managed to find the May 05 Windows .exe (I'm reminded
of Douglas Adams' Hitchhiker's Guide "beware of the leopard"
- not easy to find stuff unless you already know where to
look) but I'm not able to test the new version until next
week as I don't have access to the latest Word until then.
I'll advise on the outcome. DN


Date: 2005-10-05 12:45
Sender: hoehrmannProject AdminAccepting Donations

Logged In: YES
user_id=188003

http://tidy.sourceforge.net/ links to
http://dev.int64.org/tidy.html for Windows binaries. Where
did you get the builds in your 2004-11-17 comment from if a
version from 2001 is the latest you have?


Date: 2005-10-05 12:32
Sender: dcnicholls

Logged In: YES
user_id=647087

The latest compiled Windows executable is dated 2001 (which
I already have). I don't have any way to compile a Windows
executable, if there's anything more recent than 2001. The
2001 version *does* suffer from the o:p problem. DN


Date: 2005-10-05 11:57
Sender: arnaud02Project Admin

Logged In: YES
user_id=566665

Set to pending again. Post an update or open a new bug if a
problem arises with the latest version of tidy.



Date: 2005-10-04 10:49
Sender: dcnicholls

Logged In: YES
user_id=647087

The original attachment shows the problem, but I missed your
message dated 2005-8-4 saying it was fixed in CVS. (the
layout of this system lends itself to missing stuff). Does
this mean it's fixed in the version available on
Sourceforge? If so, I'll test it again tomorrow (5 Oct). DN


Date: 2005-10-04 08:17
Sender: arnaud02Project Admin

Logged In: YES
user_id=566665

Please provide a small example and tell which version of
tidy you are using along with which options.



Date: 2005-10-04 02:46
Sender: kerri9494

Logged In: YES
user_id=1322758

Closed again?

Anyway, problem still seems to exist...I see it on Mac OS, even with
Word2K for Mac.


Date: 2005-10-04 02:20
Sender: sf-robotSourceForge.net Site Admin

Logged In: YES
user_id=1312539

This Tracker item was closed automatically by the system. It was
previously set to a Pending status, and the original submitter
did not respond within 30 days (the time period specified by
the administrator of this Tracker).


Date: 2005-09-03 03:06
Sender: dcnicholls

Logged In: YES
user_id=647087

Not clear why the robot should kill stuff. The problem
certainly isn't solved. It occurs on WIndows 2K and XP,
with the latest MS Word, not Word 2000. The problem doesn't
arise in Word2K. DN


Date: 2005-09-03 02:20
Sender: sf-robotSourceForge.net Site Admin

Logged In: YES
user_id=1312539

This Tracker item was closed automatically by the system. It was
previously set to a Pending status, and the original submitter
did not respond within 30 days (the time period specified by
the administrator of this Tracker).


Date: 2005-08-03 18:14
Sender: arnaud02Project Admin

Logged In: YES
user_id=566665

Fixed in CVS.


Date: 2005-08-02 17:14
Sender: kerri9494

Logged In: YES
user_id=1322758

The current version of BBEdit (8.2.2) also leaves the <o:p> elements in
place, so I wonder if this really could be a Windows-specific problem. It
is
especially problematic because the <o:p> element is often the only thing
in
a <p> element, but since the <p> is non-empty, it doesn't trim it even if

you've chosen to trim empty <p> elements.


Date: 2005-04-09 20:53
Sender: nobody

Logged In: NO

fall back on all unknown ip address that use roving multipliers


Date: 2005-02-23 14:20
Sender: hoehrmannProject AdminAccepting Donations

Logged In: YES
user_id=188003

I don't think anyone is working on the word2000 cleanup, we
should consider moving this to the feature requests...


Date: 2004-11-29 04:11
Sender: terry_teague

Logged In: YES
user_id=225318

See also :

<https://sourceforge.net/tracker/index.php?func=detail&aid=1049346&
group_id=27659&atid=390964>


Date: 2004-11-17 02:50
Sender: dcnicholls

Logged In: YES
user_id=647087

Further work (ie cleaning the same file from within Homesite
with different Tidy versions) suggests this problem was
introduced in the Windows build somewhere between the
versions distributed with Homesite (created 16 Jan 01) or
Topstyle 3 (4 May 02) and recent windows binary tidy.exe
builds (17 Mar 04, 5 Jul 04 and 6 Oct 04).
The <o:p> tag is removed by old builds and not by the newer
ones.
Also it may be Windows-specific, as the online version of
Tidy at http://infohound.net/tidy/ (on Linux) does not have
the problem. I've also had success with TidyUI but not with
HTMLTrim, with (as close as possible) the same settings.
DN


Date: 2004-11-16 04:04
Sender: dcnicholls

Logged In: YES
user_id=647087

The same thing occurs using HTMLTrim (19 Aug 04). DN


Attached File ( 1 )

Filename Description Download
test2ka.htm Word 2K output "HTML" Download

Changes ( 26 )

Field Old Value Date By
close_date - 2005-10-05 13:15 arnaud02
status_id Open 2005-10-05 13:15 arnaud02
close_date 2005-10-05 12:51 2005-10-05 13:00 dcnicholls
status_id Pending 2005-10-05 13:00 dcnicholls
status_id Open 2005-10-05 12:51 arnaud02
close_date - 2005-10-05 12:51 arnaud02
close_date 2005-10-05 11:57 2005-10-05 12:32 dcnicholls
status_id Pending 2005-10-05 12:32 dcnicholls
status_id Open 2005-10-05 11:57 arnaud02
close_date - 2005-10-05 11:57 arnaud02
close_date 2005-10-04 08:17 2005-10-04 10:49 dcnicholls
status_id Pending 2005-10-04 10:49 dcnicholls
close_date 2005-10-04 02:20 2005-10-04 08:17 arnaud02
status_id Closed 2005-10-04 08:17 arnaud02
close_date 2005-09-03 03:06 2005-10-04 02:20 sf-robot
status_id Pending 2005-10-04 02:20 sf-robot
status_id Closed 2005-09-03 03:06 dcnicholls
close_date 2005-09-03 02:20 2005-09-03 03:06 dcnicholls
status_id Pending 2005-09-03 02:20 sf-robot
close_date 2005-08-03 18:14 2005-09-03 02:20 sf-robot
resolution_id None 2005-08-03 18:14 arnaud02
status_id Open 2005-08-03 18:14 arnaud02
close_date - 2005-08-03 18:14 arnaud02
artifact_group_id Current - Win32 specific 2004-11-29 04:11 terry_teague
category_id Other 2004-11-29 04:11 terry_teague
File Added 108929: test2ka.htm 2004-11-16 03:06 dcnicholls