Encodings in Legacy Khmer TrueType Fonts (notice n° 445012)
[ vue normale ]
000 -LEADER | |
---|---|
fixed length control field | 02969cam a2200289 4500500 |
005 - DATE AND TIME OF LATEST TRANSACTION | |
control field | 20250121011621.0 |
041 ## - LANGUAGE CODE | |
Language code of text/sound track or separate title | fre |
042 ## - AUTHENTICATION CODE | |
Authentication code | dc |
100 10 - MAIN ENTRY--PERSONAL NAME | |
Personal name | Toshiya, Suzuki |
Relator term | author |
245 00 - TITLE STATEMENT | |
Title | Encodings in Legacy Khmer TrueType Fonts |
260 ## - PUBLICATION, DISTRIBUTION, ETC. | |
Date of publication, distribution, etc. | 2007.<br/> |
500 ## - GENERAL NOTE | |
General note | 21 |
520 ## - SUMMARY, ETC. | |
Summary, etc. | RésuméMalgré l’utilisation croissante des normes ISO pour le codage de la plupart des écritures indiennes dans les pays d’Asie du Sud et du Sud-Est, les codages bricolés sont encore utilisés pour éviter l’implémentation de systèmes de mise en page complexes pour les écritures indiennes. Puisque ces codages ne sont pas bien définis et ont été développés ad hoc, il est très difficile de les détecter par des méthodes déductives. Il arrive alors que l’on publie le texte le plus souvent en tant qu’image. En guise d’exemple caractéristique nous avons choisi l’écriture cambodgienne. Nous avons examiné les différentes fontes cambodgiennes diffusées gratuitement sur le Web, les codages qu’elles déclarent utiliser et ceux qu’elles utilisent. Nous arrivons à la conclusion que les codages déclarés ne sont pas fiables. En nous basant sur les codages étudiés nous proposons un algorithme heuristique de détection de codage cambodgien. Cet algorithme nous permet également de transcoder des documents codés dans ces codages avec une précision plus élevée que celle des méthodes cognitives. |
520 ## - SUMMARY, ETC. | |
Summary, etc. | In spite of ISO standards for most Indic scripts used in South and South-East Asian countries, legacy encodings are still used to avoid the implementation of complex text layout systems for Indic scripts. Since legacy encodings for Indic scripts are not well-defined and have been designed ad hoc, it is almost impossible to detect the encoding by deducive methods. As a result, the coded text is often dealt as image data rather than text. As a typical example of the confusion of non-standard legacy encodings, we take Khmer script. We collected the various free-charged legacy Khmer fonts distributed on the Web, and investigated the encodings declared and used in the fonts. As a result, the declared encodings are confirmed to be unreliable. Based on the code charts obtained by our investigation, we propose a heuristic algorithm to detect the encoding used in legacy Khmer fonts. This algorithm enables us to extract text data from legacy coded text with an accuracy higher than the one of cognitive methods. |
690 ## - LOCAL SUBJECT ADDED ENTRY--TOPICAL TERM (OCLC, RLIN) | |
Topical term or geographic name as entry element | échange d'information |
690 ## - LOCAL SUBJECT ADDED ENTRY--TOPICAL TERM (OCLC, RLIN) | |
Topical term or geographic name as entry element | Cambodgien |
690 ## - LOCAL SUBJECT ADDED ENTRY--TOPICAL TERM (OCLC, RLIN) | |
Topical term or geographic name as entry element | Unicode |
690 ## - LOCAL SUBJECT ADDED ENTRY--TOPICAL TERM (OCLC, RLIN) | |
Topical term or geographic name as entry element | fonte TrueType |
690 ## - LOCAL SUBJECT ADDED ENTRY--TOPICAL TERM (OCLC, RLIN) | |
Topical term or geographic name as entry element | TrueType Font |
690 ## - LOCAL SUBJECT ADDED ENTRY--TOPICAL TERM (OCLC, RLIN) | |
Topical term or geographic name as entry element | Unicode |
690 ## - LOCAL SUBJECT ADDED ENTRY--TOPICAL TERM (OCLC, RLIN) | |
Topical term or geographic name as entry element | Khmer |
690 ## - LOCAL SUBJECT ADDED ENTRY--TOPICAL TERM (OCLC, RLIN) | |
Topical term or geographic name as entry element | Information Interchange |
700 10 - ADDED ENTRY--PERSONAL NAME | |
Personal name | Yamato, Masatake |
Relator term | author |
700 10 - ADDED ENTRY--PERSONAL NAME | |
Personal name | Mikami, Yoshiki |
Relator term | author |
786 0# - DATA SOURCE ENTRY | |
Note | Document numérique | 9 | 3 | 2007-09-10 | p. 45-68 | 1279-5127 |
856 41 - ELECTRONIC LOCATION AND ACCESS | |
Uniform Resource Identifier | <a href="https://shs.cairn.info/revue-document-numerique-2006-3-page-45?lang=fr&redirect-ssocas=7080">https://shs.cairn.info/revue-document-numerique-2006-3-page-45?lang=fr&redirect-ssocas=7080</a> |
Pas d'exemplaire disponible.
Réseaux sociaux