There are a number of character which can never appear in a valid URL. These are defined in the RFC. The URL recognizer should know that the URL stops when one of these characters occurs.
The OffsiteLinkTranslator now stops on ',' and '.', which
are the most common problems in pages. If there are more
characters you'd like to point out, I'd be happy to add them.
If you would like to refer to this comment somewhere else in this project, copy and paste the following link:
The patch submitted by AveryRegier has code in it to handle URL's within (), [], <> and ending with punctuation ,.
The patch submitted by AveryRegier has code in it to handle URL's within (), [], <> and ending with punctuation ,.
Logged In: YES
user_id=29848
The OffsiteLinkTranslator now stops on ',' and '.', which
are the most common problems in pages. If there are more
characters you'd like to point out, I'd be happy to add them.