#601 att.patternReplacement/@matchPattern should be XPath regex, not XML Schema

AMBER
closed
None
5(default)
2014-08-23
2013-09-25
No

The regular expression syntax for the @matchPattern attribute in att.patternReplacement is defined as W3C XML Schema regex (for which I'm responsible, IIRC, so mea culpa). Hugh Cayless points out that by default, XML Schema regexes are anchored at both ends, which is severely restrictive. Instead, I think we should redefine it as XPath syntax:

http://www.w3.org/TR/xpath-functions/#regex-syntax

which is based on XML Schema but adds a number of useful extras, including the availability of explicit anchors. I think this is just a corrigible error, and since the current definition is subsumed in the broader one I now propose, we can make this change with no detrimental effects.

Related

Bugs: #628

Discussion

  • Martin Holmes

    Martin Holmes - 2013-11-13

    Council 2013-11-13: MH will write to TEI-L to check whether anyone has actually depended on the limitations of the XML Schema version of regex; if not, implement this. Noted that processing in the Stylesheets is already being done with XSLT2, so is assuming XPath regex patterns.

     
  • Martin Holmes

    Martin Holmes - 2013-11-13
    • status: open --> open-accepted
    • Group: AMBER --> GREEN
     
  • Sebastian Rahtz

    Sebastian Rahtz - 2013-11-13
    • status: open-accepted --> closed
    • Group: GREEN --> AMBER
     
  • Sebastian Rahtz

    Sebastian Rahtz - 2013-11-13

    See feature-requests/432/. closing here