Steven Sauder
2008-07-29 03:18:28 UTC
Hi all!
We?re a long-time user of OmniOrb with great success in our applications,
but something has recently come up which is causing problems for our
European customers. Our applications all speak the (full) Windows CP1252
(Windows Latin 1) character set, in which Microsoft has used the code point
0x80 to represent the Euro symbol (?). CP1252 and ISO-8859-1 are ?almost?
the same, except that CP1252 utilizes the 0x80 code point to represent the
Euro, where ISO-8859-1 leaves this code point blank.
After a bit of investigation, it seems that OmniOrb by default uses
ISO-8859-1 as the ?native? codeset, which I had thought would mean that the
Euro symbol (and a couple of other ?special? characters such as the
trademark symbol, and the ?curly? printers quotes), which are represented in
CP1252, but not in ISO-8859-1, could not be handled by OmniOrb using its
default codeset. However, digging into cs-8859-1.cc a little more, it looks
like the translation tables ARE passing 0x80 through to UCS as 0x0080, so
unless I?m reading this wrong, any OmniOrb-to-OmniOrb communications (on
Windows) should pass the (Windows-specific) Euro code point 0x80 through
without problem. Am I reading this right?
However, the difficulty arises because we have several CORBA components
which are written using the standard Java ORB, which (it appears) is not
providing the same amount of leeway with this symbol, and insists on
transmitting the Euro symbol in it?s ?true? UCS16 representation (0x20AC),
which OmniOrb?s codeset converters end up turning into a ??? when we receive
it on the Windows end.
Has anyone had any experience with this? From what I?ve read so far, it
seems the only viable solution would be to write our own NCS-C
implementation that handled the CP1252 Euro symbol (0x80) to Unicode
(0x20AC) and back-again conversion through the translation tables as is
currently happening in cs-8859-1.cc, is this correct?
Any help would be hugely appreciated!
Thanks
Steve.
--
Steve Sauder
Chief Technology Officer
North Plains Systems Corp.
510 Front Street West, 4th Floor
Toronto, ON
Canada M5V 3H3
P: (416) 345-1900 ext. 500
F: (416) 599-0808
W: http://www.northplains.com/
E: ***@northplains.com
Confidentiality Notice:
The information contained herein is confidential and proprietary to North
Plains Systems Corp. ("North Plains") and is intended for review by
authorized persons only. Except as may otherwise be agreed to in writing by
North Plains, any disclosure, circulation, release or use of the information
contained herein is strictly prohibited.
Upcoming Webinar:
Marketing Made Easy With Digital Asset Management
August 14th, 2008 ? 1:00PM EST (10:00AM PST)
Click to register:
http://www.northplains.com/news/newsItem.cfm?cms_news_id=191&cms_news_type_i
d=13
TUG 2008 Conference
September 8th & 9th, 2008
Click to register:
http://www.northplains.com/en/customer_portal/conference.cfm
-------------- next part --------------
An HTML attachment was scrubbed...
URL: http://www.omniorb-support.com/pipermail/omniorb-list/attachments/20080728/94f5c2ec/attachment.htm
We?re a long-time user of OmniOrb with great success in our applications,
but something has recently come up which is causing problems for our
European customers. Our applications all speak the (full) Windows CP1252
(Windows Latin 1) character set, in which Microsoft has used the code point
0x80 to represent the Euro symbol (?). CP1252 and ISO-8859-1 are ?almost?
the same, except that CP1252 utilizes the 0x80 code point to represent the
Euro, where ISO-8859-1 leaves this code point blank.
After a bit of investigation, it seems that OmniOrb by default uses
ISO-8859-1 as the ?native? codeset, which I had thought would mean that the
Euro symbol (and a couple of other ?special? characters such as the
trademark symbol, and the ?curly? printers quotes), which are represented in
CP1252, but not in ISO-8859-1, could not be handled by OmniOrb using its
default codeset. However, digging into cs-8859-1.cc a little more, it looks
like the translation tables ARE passing 0x80 through to UCS as 0x0080, so
unless I?m reading this wrong, any OmniOrb-to-OmniOrb communications (on
Windows) should pass the (Windows-specific) Euro code point 0x80 through
without problem. Am I reading this right?
However, the difficulty arises because we have several CORBA components
which are written using the standard Java ORB, which (it appears) is not
providing the same amount of leeway with this symbol, and insists on
transmitting the Euro symbol in it?s ?true? UCS16 representation (0x20AC),
which OmniOrb?s codeset converters end up turning into a ??? when we receive
it on the Windows end.
Has anyone had any experience with this? From what I?ve read so far, it
seems the only viable solution would be to write our own NCS-C
implementation that handled the CP1252 Euro symbol (0x80) to Unicode
(0x20AC) and back-again conversion through the translation tables as is
currently happening in cs-8859-1.cc, is this correct?
Any help would be hugely appreciated!
Thanks
Steve.
--
Steve Sauder
Chief Technology Officer
North Plains Systems Corp.
510 Front Street West, 4th Floor
Toronto, ON
Canada M5V 3H3
P: (416) 345-1900 ext. 500
F: (416) 599-0808
W: http://www.northplains.com/
E: ***@northplains.com
Confidentiality Notice:
The information contained herein is confidential and proprietary to North
Plains Systems Corp. ("North Plains") and is intended for review by
authorized persons only. Except as may otherwise be agreed to in writing by
North Plains, any disclosure, circulation, release or use of the information
contained herein is strictly prohibited.
Upcoming Webinar:
Marketing Made Easy With Digital Asset Management
August 14th, 2008 ? 1:00PM EST (10:00AM PST)
Click to register:
http://www.northplains.com/news/newsItem.cfm?cms_news_id=191&cms_news_type_i
d=13
TUG 2008 Conference
September 8th & 9th, 2008
Click to register:
http://www.northplains.com/en/customer_portal/conference.cfm
-------------- next part --------------
An HTML attachment was scrubbed...
URL: http://www.omniorb-support.com/pipermail/omniorb-list/attachments/20080728/94f5c2ec/attachment.htm