[ List Archives Home ] [ Thread index for 2008 ]
[ Date index for 2008 ]
[ Author index for 2008 ]
[
Date Prev][
Date Next][
Thread Prev][
Thread Next][
Date Index][
Thread Index]
Just to follow up on my question about Unicode. There is some setting
on my system that was not set. Innovative has set it and now inserting
Unicode character works. If you find yourself unable to insert or copy
and paste the Unicode, you know what to do now.
I love the help desk, and I love them even more when you do not have to
ask them.
Don Zhou
William Mitchell College of Law
-----Original Message-----
From: innopac-bounces at innopacusers dot org
[
mailto:innopac-bounces at innopacusers dot org] On Behalf Of Bob Rasmussen
Sent: Thursday, February 16, 2006 12:49 PM
To: IUG INNOPAC List
Subject: Re: [IUG] insert unicode problem
On Thu, 16 Feb 2006, Cliff Glaviano wrote:
> ...
> I suspect that Millennium unicode and OCLC unicode are not
identical ...
I'm working strictly based on Unicode theory here...
The Unicode encoding of accented characters provides two ways of storing
them. An A-acute, for instance, can be stored as Unicode C1 (often
denoted
as U+00A1), which is also its hex value in the Latin-1 set. This is the
"combined" form.
It can alternatively be stored as the "A" (U+0041), followed by a
combining acute (U+0301). This is the "decomposed" form.
Ideally, the two forms will display and print identically. And there are
Windows calls that a programmer can use to compose and decompose; that
is,
to convert between the two forms.
Some things to note:
1. A character such as those used by Vietnamese can have multiple
combining accents. It could exist in a "partially decomposed" state.
2. The combining accents are mostly in the U+03XX range.
3. Some character/accent combinations used in libraries do not exist in
combined form in Unicode, because they to not exist in actual languages.
Instead, they are used only for transliterations. The combining double
tilde, for instance (which is actually two characters), is not used in
Cyrillic, but is used for transliterations of Cyrillic.
Now because Innopac/Millennium's background is based on MARC, I believe
it
still uses the MARC character set (formerly ANSEL) to store data. I
suspect that it likes decomposed characters, not combined characters.
If someone wants to send me a Word document (off-list) that contains one
line of characters that WILL work and one line that WON'T work, I'll
diagnose it and post my results.
Regards,
....Bob Rasmussen, President, Rasmussen Software, Inc.
personal e-mail: ras at anzio dot com
company e-mail: rsi at anzio dot com
voice: (US) 503-624-0360 (9:00-6:00 Pacific Time)
fax: (US) 503-624-0760
web:
http://www.anzio.com
--
This message was distributed through the Innovative Users Group INNOPAC
list
Public replies: INNOPAC at innopacusers dot org
Update your subscription options:
http://innopacusers.org/mailman/listinfo/innopac
_______________________________________________________________________
This email has been scanned for all viruses.
_______________________________________________________________________
This email has been scanned for all viruses.