support for 32-bit Unicode

Subject: support for 32-bit Unicode
From: Tomas Frydrych (tomas@frydrych.uklinux.net)
Date: Mon Feb 04 2002 - 04:02:40 CST

sorted by: [ date ] [ thread ] [ subject ] [ author ]
Next message: F J Franklin: "Re: support for 32-bit Unicode"
Previous message: Martin Sevior: "commit: fix 2654"
In reply to: Andrew Dunbar: "Re: from AbiWord to AbiSuite [was Re: A new draw on XP refactoring"
Next in thread: F J Franklin: "Re: support for 32-bit Unicode"
Next in thread: Paul Rohr: "AbiDraw UI (was: Re: A new draw on XP refactoring)"
Reply: Tomas Frydrych: "support for 32-bit Unicode"
Reply: F J Franklin: "Re: support for 32-bit Unicode"
Reply: Anthony Fok: "Re: support for 32-bit Unicode"
Reply: Anthony Fok: "Re: support for 32-bit Unicode (using UTF-16?)"
Reply: Martin Sevior: "Re: support for 32-bit Unicode"

> What we need to do is support the full 32-bit Unicode
> character set but we shouldn't use UTF-32 to do it
> since we'll waste vast amounts of memory space since
> characters above 16-bit are very very rare. We need
> to instead switch to UTF-8 internally for everything.
> This is the right answer for several reasons which
> have all been covered in depth on several mailing
> lists
Since the characters have a variable bit-widthutf, utf-8 processing is
very cpu intensive for everything but the basic 7-bit ascii charset. It
is not meant to be used interanlly by applications, it is meant as
an encoding for communication between applications over 8-bit
chanells. Internally we need to use a fixed-width encoding, so if we
want to support 32-bit Unicode, we have to redefine UT_UCSChar
to long.

I agree that having 32 UT_UCSChar would vaste lot of memory, and
I would like to see a case made first why we need to support 32-bit
Unicode.

Tomas

Next message: F J Franklin: "Re: support for 32-bit Unicode"
Previous message: Martin Sevior: "commit: fix 2654"
In reply to: Andrew Dunbar: "Re: from AbiWord to AbiSuite [was Re: A new draw on XP refactoring"
Next in thread: F J Franklin: "Re: support for 32-bit Unicode"
Next in thread: Paul Rohr: "AbiDraw UI (was: Re: A new draw on XP refactoring)"
Reply: Tomas Frydrych: "support for 32-bit Unicode"
Reply: F J Franklin: "Re: support for 32-bit Unicode"
Reply: Anthony Fok: "Re: support for 32-bit Unicode"
Reply: Anthony Fok: "Re: support for 32-bit Unicode (using UTF-16?)"
Reply: Martin Sevior: "Re: support for 32-bit Unicode"

This archive was generated by hypermail 2b25 : Mon Feb 04 2002 - 04:04:23 CST