UTF-8 library

Ashley Yakeley ashley@semantic.org
Wed, 7 Aug 2002 03:14:08 -0700


At 2002-08-07 03:03, Axel Simon wrote:

>> Text encoded with ISO 8859-1 or UTF-8 is octets. If you want to use 
>> CChars, you should then subsequently convert the Word8s into CChars.
>Then I hope there is no C implementation where char is less than 8 bits 
>long.

There may or may not be. Squeezing in octets from ISO 8859-1 encoding is 
a problem you'll have to deal with either way, and is entirely separate 
from the text encoding process.

-- 
Ashley Yakeley, Seattle WA