[Date Prev][Date Next] [Thread Prev][Thread Next] [Date Index] [Thread Index]

Re: Vor/Nachteile folgender NNTP Clients



Andreas Pakulat <apaku@gmx.de> writes:

> (BTW: Das kann mit UTf-8 oder?)

Logisch. Auf Mail/News Ebene und beim Display ebenso wie beim Umgang
mit Dateien. Oder mit anderen Worten, hier die Liste der bei mir
unterstützten Enkodierungen:


###############################################
# List of coding systems in the following format:
# MNEMONIC-LETTER -- CODING-SYSTEM-NAME
#   DOC-STRING
1 -- iso-latin-1 (alias: iso-8859-1 latin-1)
  ISO 2022 based 8-bit encoding for Latin-1 (MIME:ISO-8859-1).
B -- chinese-big5 (alias: big5 cn-big5 cp950)
  BIG5 8-bit encoding for Chinese (MIME:Big5).
J -- iso-2022-jp (alias: junet)
  ISO 2022 based 7bit encoding for Japanese (MIME:ISO-2022-JP).
S -- japanese-shift-jis (alias: shift_jis sjis cp932)
  Shift-JIS 8-bit encoding for Japanese (MIME:SHIFT_JIS).
u -- mule-utf-8 (alias: utf-8)
  UTF-8 encoding for Emacs-supported Unicode characters.
0 -- iso-latin-9 (alias: iso-8859-15 latin-9 latin-0)
  ISO 2022 based 8-bit encoding for Latin-9 (MIME:ISO-8859-15).
2 -- iso-latin-2 (alias: iso-8859-2 latin-2)
  ISO 2022 based 8-bit encoding for Latin-2 (MIME:ISO-8859-2).
3 -- iso-latin-3 (alias: iso-8859-3 latin-3)
  ISO 2022 based 8-bit encoding for Latin-3 (MIME:ISO-8859-3).
4 -- iso-latin-4 (alias: iso-8859-4 latin-4)
  ISO 2022 based 8-bit encoding for Latin-4 (MIME:ISO-8859-4).
5 -- cyrillic-iso-8bit (alias: iso-8859-5)
  ISO 2022 based 8-bit encoding for Cyrillic script (MIME:ISO-8859-5).
7 -- greek-iso-8bit (alias: iso-8859-7)
  ISO 2022 based 8-bit encoding for Greek (MIME:ISO-8859-7).
8 -- hebrew-iso-8bit (alias: iso-8859-8 iso-8859-8-e iso-8859-8-i)
  ISO 2022 based 8-bit encoding for Hebrew (MIME:ISO-8859-8).
9 -- iso-latin-5 (alias: iso-8859-9 latin-5)
  ISO 2022 based 8-bit encoding for Latin-5 (MIME:ISO-8859-9).
c -- chinese-iso-8bit (alias: cn-gb-2312 euc-china euc-cn cn-gb gb2312 cp936)
  ISO 2022 based EUC encoding for Chinese GB2312 (MIME:GB2312).
E -- japanese-iso-8bit (alias: euc-japan-1990 euc-japan euc-jp)
  ISO 2022 based EUC encoding for Japanese (MIME:EUC-JP).
K -- korean-iso-8bit (alias: euc-kr euc-korea cp949)
  ISO 2022 based EUC encoding for Korean KSC5601 (MIME:EUC-KR).
T -- thai-tis620 (alias: th-tis620 tis620 tis-620)
  8-bit encoding for ASCII (MSB=0) and Thai TIS620 (MSB=1).
W -- iso-latin-8 (alias: iso-8859-14 latin-8)
  ISO 2022 based 8-bit encoding for Latin-8 (MIME:ISO-8859-14).
* -- windows-1252
  windows-1252 encoding
b -- windows-1251 (alias: cp1251)
  windows-1251 encoding
M -- mac-roman
  Mac Roman Encoding (MIME:MACINTOSH).
R -- cyrillic-koi8 (alias: koi8-r koi8 cp878)
  KOI8-R 8-bit encoding for Cyrillic (MIME: KOI8-R).
U -- koi8-u
  KOI8-U 8-bit encoding for Cyrillic (MIME: KOI8-U)
V -- vietnamese-viscii (alias: viscii)
  8-bit encoding for Vietnamese VISCII 1.1 (MIME:VISCII)
z -- chinese-hz (alias: hz-gb-2312 hz)
  Hz/ZW 7-bit encoding for Chinese GB2312 (MIME:HZ-GB-2312).
C -- iso-2022-cn-ext
  ISO 2022 based 7bit encoding for Chinese GB and CNS (MIME:ISO-2022-CN-EXT).
C -- iso-2022-cn (alias: chinese-iso-7bit)
  ISO 2022 based 7bit encoding for Chinese GB and CNS (MIME:ISO-2022-CN).
J -- iso-2022-jp-2
  ISO 2022 based 7bit encoding for CJK, Latin-1, and Greek (MIME:ISO-2022-JP-2).
k -- iso-2022-kr (alias: korean-iso-7bit-lock)
  ISO 2022 based 7-bit encoding for Korean KSC5601 (MIME:ISO-2022-KR).
u -- mule-utf-16be-with-signature (alias: utf-16be-with-signature mule-utf-16-be utf-16-be)
  Big endian UTF-16 (with BOM) for Emacs-supported Unicode characters.
u -- mule-utf-16le-with-signature (alias: utf-16le-with-signature mule-utf-16-le utf-16-le)
  Little endian UTF-16 (with BOM) for Emacs-supported Unicode characters.
u -- mule-utf-16 (alias: utf-16)
  UTF-16 (with or without BOM) for Emacs-supported Unicode characters.
u -- mule-utf-16be (alias: utf-16be)
  UTF-16BE encoding for Emacs-supported Unicode characters.
u -- mule-utf-16le (alias: utf-16le)
  UTF-16LE encoding for Emacs-supported Unicode characters.
x -- compound-text (alias: x-ctext ctext)
  Compound text based generic encoding for decoding unknown messages.
= -- emacs-mule
  Emacs internal format used in buffer and string.
= -- no-conversion
  Do no conversion.
J -- iso-2022-7bit
  ISO 2022 based 7-bit encoding using only G0
t -- raw-text
  Raw text, which means text contains random 8-bit codes.
& -- iso-2022-7bit-lock (alias: iso-2022-int-1)
  ISO-2022 coding system using Locking-Shift for 96-charset
@ -- iso-2022-8bit-ss2
  ISO 2022 based 8-bit encoding using SS2 for 96-charset
- -- iso-safe (alias: us-ascii)
  Encode ASCII asis and encode non-ASCII characters to `?'.
D -- in-is13194 (alias: devanagari)
  8-bit encoding for ASCII (MSB=0) and IS13194-Devanagari (MSB=1).
L -- lao
  8-bit encoding for ASCII (MSB=0) and LAO (MSB=1).
Q -- tibetan-iso-8bit (alias: tibetan)
  8-bit encoding for ASCII (MSB=0) and TIBETAN (MSB=1).
- -- undecided
  No conversion on encoding, automatic conversion on decoding
A -- cyrillic-alternativnyj (alias: alternativnyj)
  ALTERNATIVNYJ 8-bit encoding for Cyrillic.
j -- japanese-iso-7bit-1978-irv (alias: iso-2022-jp-1978-irv old-jis)
  ISO 2022 based 7-bit encoding for Japanese JISX0208-1978 and JISX0201-Roman.
q -- vietnamese-viqr (alias: viqr)
  Vietnamese latin transcription (VIQR)
t -- vietnamese-tcvn (alias: tcvn)
  8-bit encoding for Vietnamese TCVN-5712
v -- vietnamese-vscii (alias: vscii)
  8-bit encoding for Vietnamese VSCII-1
x -- compound-text-with-extensions (alias: x-ctext-with-extensions ctext-with-extensions)
  Compound text encoding with extended segments.
Z -- euc-tw (alias: euc-taiwan)
  ISO 2022 based EUC encoding for Chinese CNS11643.
$ -- iso-2022-7bit-ss2
  ISO 2022 based 7-bit encoding using SS2 for 96-charset
i -- iso-2022-7bit-lock-ss2 (alias: iso-2022-cjk)
  Mixture of ISO-2022-JP, ISO-2022-KR, and ISO-2022-CN
x -- ctext-no-compositions
  Compound text based generic encoding for decoding unknown messages.
0 -- iso-latin-9-with-esc
  Same as iso-latin-9 but can handle any charsets by ISO's escape sequences.
1 -- iso-latin-1-with-esc
  Same as iso-latin-1 but can handle any charsets by ISO's escape sequences.
2 -- iso-latin-2-with-esc
  Same as iso-latin-2 but can handle any charsets by ISO's escape sequences.
3 -- iso-latin-3-with-esc
  Same as iso-latin-3 but can handle any charsets by ISO's escape sequences.
4 -- iso-latin-4-with-esc
  Same as iso-latin-4 but can handle any charsets by ISO's escape sequences.
5 -- cyrillic-iso-8bit-with-esc
  Same as cyrillic-iso-8bit but can handle any charsets by ISO's escape sequences.
7 -- greek-iso-8bit-with-esc
  Same as greek-iso-8bit but can handle any charsets by ISO's escape sequences.
8 -- hebrew-iso-8bit-with-esc
  Same as hebrew-iso-8bit but can handle any charsets by ISO's escape sequences.
9 -- iso-latin-5-with-esc
  Same as iso-latin-5 but can handle any charsets by ISO's escape sequences.
c -- chinese-iso-8bit-with-esc
  Same as chinese-iso-8bit but can handle any charsets by ISO's escape sequences.
D -- in-is13194-with-esc
  Same as in-is13194 but can handle any charsets by ISO's escape sequences.
K -- korean-iso-8bit-with-esc
  Same as korean-iso-8bit but can handle any charsets by ISO's escape sequences.
L -- lao-with-esc
  Same as lao but can handle any charsets by ISO's escape sequences.
Q -- tibetan-iso-8bit-with-esc
  Same as tibetan-iso-8bit but can handle any charsets by ISO's escape sequences.
T -- thai-tis620-with-esc
  Same as thai-tis620 but can handle any charsets by ISO's escape sequences.
W -- iso-latin-8-with-esc
  Same as iso-latin-8 but can handle any charsets by ISO's escape sequences.
E -- japanese-iso-8bit-with-esc
  Same as japanese-iso-8bit but can handle any charsets by ISO's escape sequences.
####################################################
# The following coding systems are not yet loaded. #
####################################################
iso-8859-11
pt154
georgian-ps
iso-8859-13
iso-8859-10
iso-8859-16
windows-1258
windows-1257
windows-1256
windows-1255
windows-1254
windows-1253
windows-1250


und hier die untersützten Charakter Sets


ID-NUM	CHARSET-NAME			MULTIBYTE-FORM	D CH FINAL-CHAR
------	------------			--------------	- -- ----------
000(00) ascii	                         xx              1 94 B
128(80) eight-bit-graphic	             XX              1 96 none
129(81) latin-iso8859-1	                81 XX           1 96 A
130(82) latin-iso8859-2	                82 XX           1 96 B
131(83) latin-iso8859-3	                83 XX           1 96 C
132(84) latin-iso8859-4	                84 XX           1 96 D
133(85) thai-tis620	                   85 XX           1 96 T
134(86) greek-iso8859-7	                86 XX           1 96 F
135(87) arabic-iso8859-6	             87 XX           1 96 G
136(88) hebrew-iso8859-8	             88 XX           1 96 H
137(89) katakana-jisx0201	             89 XX           1 94 I
138(8A) latin-jisx0201	                8A XX           1 94 J
140(8C) cyrillic-iso8859-5	             8C XX           1 96 L
141(8D) latin-iso8859-9	                8D XX           1 96 M
142(8E) latin-iso8859-15	             8E XX           1 96 b
143(8F) latin-iso8859-14	             8F XX           1 96 _
144(90) japanese-jisx0208-1978	       90 XX XX        2 94 @
145(91) chinese-gb2312	                91 XX XX        2 94 A
146(92) japanese-jisx0208	             92 XX XX        2 94 B
147(93) korean-ksc5601	                93 XX XX        2 94 C
148(94) japanese-jisx0212	             94 XX XX        2 94 D
149(95) chinese-cns11643-1	             95 XX XX        2 94 G
150(96) chinese-cns11643-2	             96 XX XX        2 94 H
151(97) japanese-jisx0213-1	          97 XX XX        2 94 O
152(98) chinese-big5-1	                98 XX XX        2 94 0
153(99) chinese-big5-2	                99 XX XX        2 94 1
158(9E) eight-bit-control	             9E Xx           1 96 none
160(A0) chinese-sisheng	                9A A0 XX        1 94 0
161(A1) ipa	                            9A A1 XX        1 96 0
162(A2) vietnamese-viscii-lower	       9A A2 XX        1 96 1
163(A3) vietnamese-viscii-upper	       9A A3 XX        1 96 2
164(A4) arabic-digit	                   9A A4 XX        1 94 2
165(A5) arabic-1-column	                9A A5 XX        1 94 3
166(A6) ascii-right-to-left	          9A A6 XX        1 94 B
167(A7) lao	                            9A A7 XX        1 94 1
224(E0) arabic-2-column	                9B E0 XX        1 94 4
225(E1) indian-is13194	                9B E1 XX        1 94 5
240(F0) indian-glyph	                   9C F0 XX XX     2 96 4
241(F1) tibetan-1-column	             9C F1 XX XX     2 94 8
242(F2) mule-unicode-2500-33ff	       9C F2 XX XX     2 96 2
243(F3) mule-unicode-e000-ffff	       9C F3 XX XX     2 96 3
244(F4) mule-unicode-0100-24ff	       9C F4 XX XX     2 96 1
245(F5) ethiopic	                      9D F5 XX XX     2 94 3
246(F6) chinese-cns11643-3	             9D F6 XX XX     2 94 I
247(F7) chinese-cns11643-4	             9D F7 XX XX     2 94 J
248(F8) chinese-cns11643-5	             9D F8 XX XX     2 94 K
249(F9) chinese-cns11643-6	             9D F9 XX XX     2 94 L
250(FA) chinese-cns11643-7	             9D FA XX XX     2 94 M
251(FB) indian-2-column	                9D FB XX XX     2 94 5
252(FC) tibetan	                      9D FC XX XX     2 94 7
254(FE) japanese-jisx0213-2	          9D FE XX XX     2 94 P
------------------------------------------------------------------------

INDIRECTLY SUPPORTED CHARSETS SETS:

CHARSET NAME	MAPPED TO
------------	---------
windows-1252    
windows-1251    
mac-roman       ascii latin-iso8859-1 mule-unicode-2500-33ff 
                mule-unicode-0100-24ff mule-unicode-e000-ffff 
viscii          ascii vietnamese-viscii-lower vietnamese-viscii-upper 
vietnamese-tcvn ascii vietnamese-viscii-lower vietnamese-viscii-upper 
koi8-r          ascii cyrillic-iso8859-5 
alternativnyj   ascii cyrillic-iso8859-5 
koi8-u          ascii cyrillic-iso8859-5 mule-unicode-0100-24ff 
big5            ascii chinese-big5-1 chinese-big5-2 
sjis            ascii katakana-jisx0201 japanese-jisx0208 


Wenn ich recht informiert bin, geht das ganze sogar unter Windows.

Gruss, Bruno.



Reply to: