s***@e-z.net
2014-11-30 21:15:30 UTC
Items for Xalan-C release testing
Confirm integration with ICU Library. According to previous
committers, the use of ICU was for XSLT locale dependent
collation (sorting) logic. Also with Xerces-C 3.x comes
transcoding support using ICU.
The ICU also provides transcoders for over 200 code pages.
We also need to clarify the maximum unicode codepoint handled
and what constitutes string lengths. Explicit transcoding to
UTF-32 using UTF-16 surrogate pairs expressing codepoints of
0x10000 to 0x10FFFF need to be evaluated.
Xalan-C library sources of specific interest come to mind.
PlatformSupport/XalanCollationServices.hpp
PlatformSupport/TranscodingServices.*
ICUBridge/*
XSLT/ElemSort.*
XSLT/NodeSorter.*
XSLT/NodeSortKey.*
I would like some help with test builds that integrate the
current version of the ICU Library (Release ICU_54) and
CLDR 26.
- - -
XalanTranscodingServices::getMaximumCharacterValue(theEncoding)
Does not return a value larger than 0xFFFF, the largest
16-bit binary value. Unicode allows up to 0x10FFFF as codepoint.
Encoding "UTF-32" therefore returns a maximum value of 0xFFFF.
When theEncoding is missing, a value of 0x7F is returned.
We need to identify and resolve the issues for unicode codepoint
values in the range of 0x10000 through 0x10FFFF.
Sincerely,
Steven J. Hathaway
---------------------------------------------------------------------
To unsubscribe, e-mail: dev-***@xalan.apache.org
For additional commands, e-mail: dev-***@xalan.apache.org
Confirm integration with ICU Library. According to previous
committers, the use of ICU was for XSLT locale dependent
collation (sorting) logic. Also with Xerces-C 3.x comes
transcoding support using ICU.
The ICU also provides transcoders for over 200 code pages.
We also need to clarify the maximum unicode codepoint handled
and what constitutes string lengths. Explicit transcoding to
UTF-32 using UTF-16 surrogate pairs expressing codepoints of
0x10000 to 0x10FFFF need to be evaluated.
Xalan-C library sources of specific interest come to mind.
PlatformSupport/XalanCollationServices.hpp
PlatformSupport/TranscodingServices.*
ICUBridge/*
XSLT/ElemSort.*
XSLT/NodeSorter.*
XSLT/NodeSortKey.*
I would like some help with test builds that integrate the
current version of the ICU Library (Release ICU_54) and
CLDR 26.
- - -
XalanTranscodingServices::getMaximumCharacterValue(theEncoding)
Does not return a value larger than 0xFFFF, the largest
16-bit binary value. Unicode allows up to 0x10FFFF as codepoint.
Encoding "UTF-32" therefore returns a maximum value of 0xFFFF.
When theEncoding is missing, a value of 0x7F is returned.
We need to identify and resolve the issues for unicode codepoint
values in the range of 0x10000 through 0x10FFFF.
Sincerely,
Steven J. Hathaway
---------------------------------------------------------------------
To unsubscribe, e-mail: dev-***@xalan.apache.org
For additional commands, e-mail: dev-***@xalan.apache.org