Arabic needs tagging of glyphs as being `initial', `medial', `final',
and `isolated', as specified in the Unicode book. Since this is
identical for all fonts the OpenType designers have decided to make
this information not being part of the font itself.
I just had to struggle with this a little.
The ARABIC LETTER HEH (U+0647) is a letter with 4 glyph forms.
In Kurdish (written in the Sorani, essential arabic, alphabet)
one has two letters (let me call them Kurdish H and Kurdish E)
and these 4 glyph forms become the two forms of Kurdish H
and the two forms of Kurdish E.
Now these four glyphs are tagged with `initial', `medial', `final',
and `isolated', and that is correct if the glyphs are used to write
arabic, but incorrect when precisely the same glyphs are used
to write Kurdish.
I wonder what the correct way is to write Kurdish in Unicode
(without using language tagging).
Are new Unicode code points needed? Do these exist already?