Mailing List Archive: 49091 messages
  • Home
  • Script library
  • AltME Archive
  • Mailing list
  • Articles Index
  • Site search
 

[REBOL] Supported character sets?

From: gavin::mckenzie::home::com at: 23-Sep-2000 11:40

First, I want to thank everybody who provided responses to my question wrt a complement charset. Next issue...one that I think I asked a year ago and I either a) can't find the answer or b) the answer wasn't clear. I'll greatly appreciate any help I can get with the questions below. What character set (or sets) do Rebol characters and strings belong to? Given Rebol's cross platform nature I'm betting that it depends somewhat on the platform. Is there a default character set? There doesn't appear to be any immediate evidence that I can construct characters beyond the 255 boundary. So here's some scenarios: - can I construct strings encoded in UTF-8 and/or UTF-16 - can I construct strings of UCS-2 or UCS-4 characters - does the REBOL i/o support reading UTF-8/16? If I'm on a WinNT box I live in a world occupied by both Unicode (really UCS-2) and a currently selected ANSI codepage like Latin-1 or Shift-JIS. So imagine my currently selected ANSI codepage is Latin-1 -- will Rebol read in chars in Latin-1 or a Unicode based encoding? Sorry for all the questions but I build apps that handle global data and I'm struggling to figure out whether I can use Rebol for non-ASCII data. Working for a commercial software firm I've had first hand experience building cross-platform products that can handle many different encoded character streams; I know it isn't child's play, but in a world of global commerce it's becoming a necessity. Best Regards, Gavin.