This is the mail archive of the glibc-bugs@sourceware.org mailing list for the glibc project.


Index Nav: [Date Index] [Subject Index] [Author Index] [Thread Index]
Message Nav: [Date Prev] [Date Next] [Thread Prev] [Thread Next]
Other format: [Raw text]

[Bug localedata/12031] iconv -t ascii//translit with Greek characters


http://sourceware.org/bugzilla/show_bug.cgi?id=12031

squarious at gmail dot com changed:

           What    |Removed                     |Added
----------------------------------------------------------------------------
                 CC|                            |squarious at gmail dot com

--- Comment #2 from squarious at gmail dot com 2011-06-26 15:17:00 UTC ---
In general greek transliteration (greeklish) can be orthographic-correct or
just phonetic-correct.

Standards^[1][3]
=======
[ISO 843]: is orthograpih-correct where you can find complete transliteration
map from Greek characters to Latin ones. It also includes accented letters and
digraphs.
http://en.wikipedia.org/wiki/ISO_843

Examples using this ISO-843 are:
ÎÎÏÎ : PsÃri   (notice 1-letter Î -> 2-letters Ps)
ÎÏÎÎÎÎ : Îrgano 
ÎÎÎÎÏÏÎ : FlÃouto (notice exception double vowel ÎÏ -> ou)
ÎÏÏÏÎ : FÃtro
ÎÏÏÎÎÎÎÎÏÎ : AutokÃnÄto
ÎÏÎÏ : AugÃ
ÎÎÎÎÎÏÎ : FeggÃri
ÎÏÎÎÎÎ: MpÃmia

----
[UN Elot 743]: It is much like ISO 843. Key differences is that it has
different rules for consonant and vowel digraphs, as they become
phonetic-correct. It also uses more mainstream latin letters like "i" and not
"Ä" that is used by ISO-843.

ÎÏÏÎÎÎÎÎÏÎ : AftokÃnito (notice the ÎÏ becomes af to be phonetic-correct)
ÎÏÎÏ : Avgo (notice that here ÎÏ becomes av to be phonetic-correct)
ÎÎÎÎÎÏÎ : FengÃri ( notice that ÎÎ becomes ng to be phonetic-correct)

-----
[ALA-LC]: http://www.loc.gov/catdir/cpso/romanization/greek.pdf

Everyday greeklish
===========
In everyday usage (sms, im, forums etc) no one is using accented greeklish.
There is no standard conversion table and the majority does only
phonetic-correct and sometimes visually-correct transliteration.

ÎÎÏÎ : Psari
ÎÏÎÎÎÎ : Organo
ÎÎÎÎÏÏÎ : Flaouto, Flauto (you can skip "o" from "u" because "ou" sounds like
"u")
ÎÏÏÏÎ : Fytro, Fitro
ÎÏÏÎÎÎÎÎÏÎ : Aftokinito, Aytokinhto, Aftokinito
ÎÏÎÏ : Avgo, Aygo
ÎÎÎÎÎÏÎ : Feggari, Fegari
ÎÎ ÎÏÎÏ: Tha erthw, Tha ertho, 8a er8w, 8a er8o  (a lot of times "8" is used
for "Î" because it looks a bit the same - visually correct)

Personal opinion
==========
Accents are not used because usually you have Greek and US/English k/b layout.
On the US/English you don't have accents. Idioms like Î -> 8 which try to mimic
letters can be confusing so they can be skipped. What I would like to see is a
transliteration that can be written by users also. E.g. the ISO-843 is not good
one because Î => Ä , I have no clue what layout do I need to write this
accented i !? 
Off course if there is possibility to support multiple systems, will be the
best for all.

In any case, there are many higher-level frameworks that need transliteration
and it is very annoying to specialize for greek if you are based on iconv. Any
solution is welcome :)

Please ask if you need more information.

Info - References
----------------------------
[1] http://transliteration.eki.ee/pdf/Greek.pdf
[2] http://en.wikipedia.org/wiki/Greeklish
[3] http://en.wikipedia.org/wiki/Romanization_of_Greek

-- 
Configure bugmail: http://sourceware.org/bugzilla/userprefs.cgi?tab=email
------- You are receiving this mail because: -------
You are on the CC list for the bug.


Index Nav: [Date Index] [Subject Index] [Author Index] [Thread Index]
Message Nav: [Date Prev] [Date Next] [Thread Prev] [Thread Next]