[ List Archives Home ] [ Thread index for 2008 ]
[ Date index for 2008 ]
[ Author index for 2008 ]
[
Date Prev][
Date Next][
Thread Prev][
Thread Next][
Date Index][
Thread Index]
I can't remember if this has been posted, but here's the link for the
Innovative FAQ about this.
http://csdirect.iii.com/faq/diacrit.shtml
Sharon Knowlton
Support Systems Analyst, Sr.
Digital Library and Information Systems Team
University of Arizona Library
Tucson, AZ 85721-0055
Phone: (520) 307-2806
FAX: (520) 621-8276
Email: knowlton at u dot library dot arizona dot edu
"The opinions or statements expressed herein are my own and should not be
taken as a position, opinion, or endorsement of the University of Arizona."
-----Original Message-----
From: Bob Rasmussen [
mailto:ras at anzio dot com]
Sent: Thursday, December 02, 2004 10:52 PM
To: IUG INNOPAC List
Subject: RE: Unicode for all Webpacs
On Thu, 2 Dec 2004, Knowlton, Sharon wrote:
> Thanks to Sara for her response.
> I tested in our catalog and we have the same problem. For example the
> Russian title, Chetverta{235}i{236}a Vologda, displays in the Unicode
webpac
> as Chetverta︠i︡a Vologda.
> I've opened a call with Innovative to see if there is any new news, but in
> the meantime we will continue to run Unicode on a separate port.
For those who can't see how this plays out in your email, the problem is
that the combining "ligature" mark stretches over "ai", instead of over
"ia". The {235} is a combining ligature, which in MARC format would
combine with the NEXT character. When converted to Unicode, the {235}
should convert to a Unicode FE20 (which it does), but it needs to be sent
AFTER the "i".
This should not involve any recoding of your data (which is correct), or
even any systemic change of the III software. Instead, III needs to edit
the "diac map" file which is being used for the UTF-8 translation, to
properly deal with the "{235}i", the "{236}a", and any other combinations
that would involve the {235}, {236} and related characters.
I can see the same problem at other sites that have a UTF-8 option on
their telnet interface, using Anzio as the telnet client.
Regards,
....Bob Rasmussen, President, Rasmussen Software, Inc.
personal e-mail: ras at anzio dot com
company e-mail: rsi at anzio dot com
voice: (US) 503-624-0360 (9:00-6:00 Pacific Time)
fax: (US) 503-624-0760
web:
http://www.anzio.com
--
This message was distributed through the Innovative Users Group INNOPAC list
Public replies: INNOPAC at innopacusers dot org
Update your subscription options:
http://innopacusers.org/mailman/listinfo/innopac