Thread: [Pasdoc-main] Should we make 0.12.1 bugfix release? Does Delphi add (by default) UTF-8 BOM?

Documentation generator for Object Pascal code

Brought to you by: ccodere, johill, kambi

pasdoc-main

[Pasdoc-main] Should we make 0.12.1 bugfix release? Does Delphi add (by default) UTF-8 BOM?

From: Michalis K. <mic...@gm...> - 2010-11-03 09:45:44

Looks like a bug slipped into 0.12.0 release that prevents reading files
starting with UTF-8 BOM. It's fixed now of course in SVN.

I know that Lazarus and most text editors in general do not add UTF-8
BOM, so I think this isn't a problem for FPC/Lazarus users. Is it a
large problem for Delphi users? Does some new Delphi version
automatically add UTF-8 BOM? Bear in mind that I do not use/own Delphi
since a long time, so I'm asking you.

If you guys think so, we can make a 0.12.1 bugfix release, even today.

Arno implemented handling of some new Delphi features in the last days
(look at ChangeLog file), so we even have something new already :)

Michalis

Re: [Pasdoc-main] Should we make 0.12.1 bugfix release? Does Delphi add (by default) UTF-8 BOM?

From: Arno G. <arn...@gm...> - 2010-11-03 12:40:29

Michalis Kamburelis wrote:
> Looks like a bug slipped into 0.12.0 release that prevents reading
> files starting with UTF-8 BOM. It's fixed now of course in SVN.
> 
> I know that Lazarus and most text editors in general do not add UTF-8
> BOM, so I think this isn't a problem for FPC/Lazarus users. 

Using UTF-8 without BOM is eval IMO, since there is no way to detect 
a charset reliable. Any test can only verify that it is correctly 
encoded UTF-8, it might be a different charset nevertheless, and 
testing is expensive.

> large problem for Delphi users? 

I do not know how many users actually use UTF-8 source files? 

> Does some new Delphi version automatically add UTF-8 BOM? 

Yes, when it opened a UTF-8 source file without BOM (detected), it's
saved with a BOM silently (just tested in XE). In all other cases you 
have to convert a unit to UTF-8, UCS-2, UCS-2Be or UCS-4 and UCS-4Be 
explicitly in editor's context menu before Delphi saved it with a BOM,
the default is still ANSI. 

IMO PasDoc should detect all BOMs used by Delphi and raise an exception
if a charset is not supported.

> Bear in mind that I do not use/own Delphi
> since a long time, so I'm asking you.
> 
> If you guys think so, we can make a 0.12.1 bugfix release, even today.

That would be nice for the non-svn users.

-- 
Arno Garrels

Re: [Pasdoc-main] Should we make 0.12.1 bugfix release? Does Delphi add (by default) UTF-8 BOM?

From: Michalis K. <mic...@gm...> - 2010-11-03 13:29:51

Arno Garrels wrote:
> IMO PasDoc should detect all BOMs used by Delphi and raise an exception
> if a charset is not supported.

The SVN already detected UTF-8 and both UTF-16 BOMs. (In case of UTF-16,
this merely causes clear error messages when compiled without
STRING_UNICODE.)

And, I just committed a code to detect UTF-32 BOMs. So this is complete.
(You may want to make similar changes as I did in rev 1265 to the
TStreamReader.GetCodePageFromBOM).

> That would be nice for the non-svn users.

Ok, so let's release PasDoc 0.12.1. I'll do it... well, right now :)

(Only, first I'll make a promised "version marker" for cache files, to
improve serialization troubles mentioned in #3101524).

Michalis

Re: [Pasdoc-main] Should we make 0.12.1 bugfix release? Does Delphi add (by default) UTF-8 BOM?

From: Hans-Peter D. <DrD...@ao...> - 2010-11-03 13:09:54

Michalis Kamburelis schrieb:
> Looks like a bug slipped into 0.12.0 release that prevents reading files
> starting with UTF-8 BOM. It's fixed now of course in SVN.

I came across the same problem in the FPC scanner. The scanner skips the 
BOM, but then erroneously returns the first char of the BOM instead of 
the next char. Dunno what's the problem with PasDoc, but it seems to be 
resolved.

DoDi