1*0Sstevel@tonic-gatepackage Encode::KR; 2*0Sstevel@tonic-gateBEGIN { 3*0Sstevel@tonic-gate if (ord("A") == 193) { 4*0Sstevel@tonic-gate die "Encode::KR not supported on EBCDIC\n"; 5*0Sstevel@tonic-gate } 6*0Sstevel@tonic-gate} 7*0Sstevel@tonic-gateour $VERSION = do { my @r = (q$Revision: 1.23 $ =~ /\d+/g); sprintf "%d."."%02d" x $#r, @r }; 8*0Sstevel@tonic-gate 9*0Sstevel@tonic-gateuse Encode; 10*0Sstevel@tonic-gateuse XSLoader; 11*0Sstevel@tonic-gateXSLoader::load(__PACKAGE__,$VERSION); 12*0Sstevel@tonic-gate 13*0Sstevel@tonic-gateuse Encode::KR::2022_KR; 14*0Sstevel@tonic-gate 15*0Sstevel@tonic-gate1; 16*0Sstevel@tonic-gate__END__ 17*0Sstevel@tonic-gate 18*0Sstevel@tonic-gate=head1 NAME 19*0Sstevel@tonic-gate 20*0Sstevel@tonic-gateEncode::KR - Korean Encodings 21*0Sstevel@tonic-gate 22*0Sstevel@tonic-gate=head1 SYNOPSIS 23*0Sstevel@tonic-gate 24*0Sstevel@tonic-gate use Encode qw/encode decode/; 25*0Sstevel@tonic-gate $euc_kr = encode("euc-kr", $utf8); # loads Encode::KR implicitly 26*0Sstevel@tonic-gate $utf8 = decode("euc-kr", $euc_kr); # ditto 27*0Sstevel@tonic-gate 28*0Sstevel@tonic-gate=head1 DESCRIPTION 29*0Sstevel@tonic-gate 30*0Sstevel@tonic-gateThis module implements Korean charset encodings. Encodings supported 31*0Sstevel@tonic-gateare as follows. 32*0Sstevel@tonic-gate 33*0Sstevel@tonic-gate 34*0Sstevel@tonic-gate Canonical Alias Description 35*0Sstevel@tonic-gate -------------------------------------------------------------------- 36*0Sstevel@tonic-gate euc-kr /\beuc.*kr$/i EUC (Extended Unix Character) 37*0Sstevel@tonic-gate /\bkr.*euc$/i 38*0Sstevel@tonic-gate ksc5601-raw Korean standard code set (as is) 39*0Sstevel@tonic-gate cp949 /(?:x-)?uhc$/i 40*0Sstevel@tonic-gate /(?:x-)?windows-949$/i 41*0Sstevel@tonic-gate /\bks_c_5601-1987$/i 42*0Sstevel@tonic-gate Code Page 949 (EUC-KR + 8,822 43*0Sstevel@tonic-gate (additional Hangul syllables) 44*0Sstevel@tonic-gate MacKorean EUC-KR + Apple Vendor Mappings 45*0Sstevel@tonic-gate johab JOHAB A supplementary encoding defined in 46*0Sstevel@tonic-gate Annex 3 of KS X 1001:1998 47*0Sstevel@tonic-gate iso-2022-kr iso-2022-kr [RFC1557] 48*0Sstevel@tonic-gate -------------------------------------------------------------------- 49*0Sstevel@tonic-gate 50*0Sstevel@tonic-gateTo find how to use this module in detail, see L<Encode>. 51*0Sstevel@tonic-gate 52*0Sstevel@tonic-gate=head1 BUGS 53*0Sstevel@tonic-gate 54*0Sstevel@tonic-gateWhen you see C<charset=ks_c_5601-1987> on mails and web pages, they really 55*0Sstevel@tonic-gatemean "cp949" encodings. To fix that, the following aliases are set; 56*0Sstevel@tonic-gate 57*0Sstevel@tonic-gate qr/(?:x-)?uhc$/i => '"cp949"' 58*0Sstevel@tonic-gate qr/(?:x-)?windows-949$/i => '"cp949"' 59*0Sstevel@tonic-gate qr/ks_c_5601-1987$/i => '"cp949"' 60*0Sstevel@tonic-gate 61*0Sstevel@tonic-gateThe ASCII region (0x00-0x7f) is preserved for all encodings, even 62*0Sstevel@tonic-gatethough this conflicts with mappings by the Unicode Consortium. See 63*0Sstevel@tonic-gate 64*0Sstevel@tonic-gateL<http://www.debian.or.jp/~kubota/unicode-symbols.html.en> 65*0Sstevel@tonic-gate 66*0Sstevel@tonic-gateto find out why it is implemented that way. 67*0Sstevel@tonic-gate 68*0Sstevel@tonic-gate=head1 SEE ALSO 69*0Sstevel@tonic-gate 70*0Sstevel@tonic-gateL<Encode> 71*0Sstevel@tonic-gate 72*0Sstevel@tonic-gate=cut 73