Jump to content

Windows-1257

From Wikipedia, the free encyclopedia
(Redirected from CP1257)
Windows-1257
MIME / IANAwindows-1257
Alias(es)cp1257 (Code page 1257)
Language(s)Estonian, Latvian, Lithuanian, Latgalian, (also supports Polish, Slovene, Swedish, Finnish, Norwegian, Danish, German, English, Māori, Rotokas, Hawaiian, Niuean, Samoan, Tokelauan, Tongan, Tuvaluan, Hepburn romanization/Japanese transliteration)
Created byMicrosoft
StandardLST 1590-3, WHATWG Encoding Standard
Classificationextended ASCII, Windows-125x
Other related encoding(s)IBM-922, ISO 8859-13, LST 1590-4

Windows-1257 (Windows Baltic) is an 8-bit, single-byte extended ASCII code page used to support the Estonian (which also used in Windows-1252), Latvian and Lithuanian languages under Microsoft Windows. In Lithuania, it is standardised as LST 1590-3, alongside a modified variant named LST 1590-4.[1][2]

The label Windows-1257 was registered with the IANA in 1996, citing a publication of the specification in 1995 and inclusion with pan-European versions of Windows 95.[3] The later ISO 8859-13 encoding (first published in 1998) is similar, but differs in reserving the range 0x80–9F for control characters, and accordingly locating certain quotation marks at codepoints 0xA1, 0xA5, 0xB4 and 0xFF instead (the latter two are used for spacing diacritics in Windows-1257). Windows-1257 is not compatible with the older ISO 8859-4 and ISO 8859-10 encodings. For the letters of the Estonian alphabet, Windows-1257 is compatible with IBM-922.

IBM uses code page 1257 (CCSID 1257, euro sign extended CCSID 5353, and the further extended CCSID 9449) for Windows-1257.[4][5][6][7]

As with many other code pages, the languages supported in this code page can be supported in other code pages. The Estonian language can be written with Windows-1252. It is possible, but unusual, to write Polish, Slovene, Swedish, Finnish, Norwegian, Danish and German using this code page. The German specific characters will be identical to those encoded in Windows-1252.

Unicode is preferred to Windows-1257 in modern applications.

Character set

[edit]

The following table shows Windows-1257. Each character is shown with its Unicode equivalent in the tooltip.

Windows-1257[8][9][10][11][12][13][14]
0 1 2 3 4 5 6 7 8 9 A B C D E F
0x NUL SOH STX ETX EOT ENQ ACK BEL BS HT LF VT FF CR SO SI
1x DLE DC1 DC2 DC3 DC4 NAK SYN ETB CAN EM SUB ESC FS GS RS US
2x  SP  ! " # $ % & ' ( ) * + , - . /
3x 0 1 2 3 4 5 6 7 8 9 : ; < = > ?
4x @ A B C D E F G H I J K L M N O
5x P Q R S T U V W X Y Z [ \ ] ^ _
6x ` a b c d e f g h i j k l m n o
7x p q r s t u v w x y z { | } ~ DEL
8x ¨ ˇ ¸
9x ¯ ˛
Ax NBSP ¢ £ ¤ ¦ § Ø © Ŗ « ¬ SHY ® Æ
Bx ° ± ² ³ ´ µ · ø ¹ ŗ » ¼ ½ ¾ æ
Cx Ą Į Ā Ć Ä Å Ę Ē Č É Ź Ė Ģ Ķ Ī Ļ
Dx Š Ń Ņ Ó Ō Õ Ö × Ų Ł Ś Ū Ü Ż Ž ß
Ex ą į ā ć ä å ę ē č é ź ė ģ ķ ī ļ
Fx š ń ņ ó ō õ ö ÷ ų ł ś ū ü ż ž ˙
  Unassigned
  Differences from Windows-1252

References

[edit]
  1. ^ "Rašmenų koduotės". Lietuvių kalba informacinėse technologijose.
  2. ^ "LST 1590-3". Rašmenų koduotės.
  3. ^ Lazhintseva, Katya (1996-05-03). "Registration of new MIME charset: Windows-1257". IANA.
  4. ^ "Code page 1257 information document". Archived from the original on 2016-03-03.
  5. ^ "CCSID 1257 information document". Archived from the original on 2016-03-26.
  6. ^ "CCSID 5353 information document". Archived from the original on 2014-11-29.
  7. ^ "CCSID 9449 information document". Archived from the original on 2014-11-29.
  8. ^ Unicode mapping table for Windows 1257
  9. ^ Unicode mappings of windows 1257 with "best fit"
  10. ^ Code Page CPGID 01257 (pdf) (PDF), IBM
  11. ^ Code Page CPGID 01257 (txt), IBM
  12. ^ International Components for Unicode (ICU), ibm-1257_P100-1995.ucm, 2002-12-03
  13. ^ International Components for Unicode (ICU), ibm-5353_P100-1998.ucm, 2002-12-03
  14. ^ International Components for Unicode (ICU), ibm-9449_P100-2002.ucm, 2003-10-07
[edit]