[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

UCData 2.1 released with Bidi algorithm



Version 2.1 of the UCData freeware package has been released.  This release
provides some bug fixes, and update for the new (apparently undocumented)
Unicode 3.0 bidirectional categories, and the addition of the "Pretty Good
Bidi Algorithm."

The PGBA is an elegant and simple one-pass bidi reordering algorithm that
works pretty dang good for most text.  It has some deliberate, but minor
shortcomings just so developers who use it have something to keep them
occupied :-) The PGBA is in no way related to the Unicode Bidi Algorithm
except by coincidence.

IMPORTANT: The PGBA is dependent on UCData because of the interpretation of
           certain 3.0 bidi categories.  To be explicit, the following
           bidi category assumptions are made when building the character type
           data file:

             "AL" is equivalent to the "R" property.

             "BM", "NSM", "LRE", "RLE", "LRO", "RLO", "PDF" are all equivalent
             to the "ON" property.

           If your character type package of preference has these assumptions,
           then using the PGBA will be no problem.

Short and simple info page:

  http://crl.nmsu.edu/~mleisher/ucdata.html

The distribution is available in .tar.gz and .zip form from:

  http://crl.nmsu.edu/~mleisher/ucdata-2.1.tar.gz
  http://crl.nmsu.edu/~mleisher/ucdata21.zip

  ftp://crl.nmsu.edu/CLR/multiling/unicode/ucdata.tar.gz
  ftp://crl.nmsu.edu/CLR/multiling/unicode/ucdata.zip

Changes
-------

  o  Updated for new Unicode 3.0 bidi property tags.

  o  Fixed some problems with signedness and comparison in the Java class.

  o  Added a bidi reordering algorithm.
-
Linux-UTF8:   i18n of Linux on all levels
Archive:      http://mail.nl.linux.org/lists/