1. Purpose 2. PHP and character sets 3. Supported character sets and encodings 4. Copying |
5. Examples 6. Requirements 7. See also 8. Downloading |
Usage: charconv [-h] <incharset> <outcharset> Reads stdin, outputs stdout. Does incharset->outcharset conversion via unicode. -h = Input is html (THIS BUGS) Available character sets/encodings: - unihtml (&#number; codes) - utf8linux (with vt100 escape codes) - utf7mod (imap modified) - koi8r - jis-x-0201 - shift_jis - big5 - iso-8859-1 - iso-8859-2 - iso-8859-3 - iso-8859-4 - iso-8859-5 - iso-8859-6 - iso-8859-7 - iso-8859-8 - iso-8859-9 - iso-8859-10 - iso-8859-13 - iso-8859-14 - iso-8859-15 - cp437 - cp737 - cp775 - cp850 - cp852 - cp855 - cp857 - cp860 - cp861 - cp862 - cp863 - cp864 - cp865 - cp866 - cp869 - cp874 - cp1250 - cp1252 - cp1254 - cp1256 - cp1258 - cp1251 - cp1253 - cp1255 - cp1257 - cp856 - cp1006 - cp424 - roman - romanian - iso-2022-jp - utf8 - utf7 - euc-jp Typoes are allowed to some degree in the character set names, and some general aliases like latin* and iso* are known.
If you want to make your own converter or just study how something works, you might still want to download this program. The package contains plain TXT files describing the character sets, and there are .cc files for each different encoding.
oktober:~/src/charconv$ echo 'Äiti tykkää oliiviöljystä'|charconv latin1 utf7 +AMQ-iti tykk+AOQA5A oliivi+APY-ljyst+AOQ oktober:~/src/charconv$ echo '+AMQ-iti tykk+AOQA5A oliivi+APY-ljyst+AOQ'|charconv utf7 unihtml Äiti tykkää oliiviöljystä oktober:~/src/charconv$ echo 'pikachu' | sed -f /WWW/src/kr2k.sed | charconv sjis utf8 ć“ćć”ć oktober:~/src/charconv$ echo -e '\33$B$P$+\33(B' | charconv iso-2022-jp unihmtl Charconv: Warning: Assuming 'unihmtl' means 'unihtml' ばか oktober:~/src/charconv$ echo 'Ōčķė’ķäč’' | charconv cp1251 koi8r Charconv: Warning: Assuming 'koi8r' means 'koi8-r' ęÉĪĢŃĪÄÉŃ
If you are converting HTML pages, use htmlrecode instead. It handles them (and changes the character set) losslessly.
Generated from
progdesc.php
(last updated: Mon, 2 Sep 2002 04:24:50 +0300)
with docmaker.php
(last updated: Tue, 13 Aug 2002 14:17:29 +0300)
at Mon, 2 Sep 2002 04:24:56 +0300