String::Multibyte::EUC_JP - internally used by String::Multibyte for EUC-JP |
String::Multibyte::EUC_JP - internally used by String::Multibyte for EUC-JP
use String::Multibyte;
$eucjp = String::Multibyte->new('EUC_JP'); $eucjp_length = $eucjp->length($eucjp_string);
String::Multibyte::EUC_JP
is used for manipulation of strings
in EUC-JP family encodings.
Byte range of single-byte characters:
0x00..0x7F
.
First and second byte range of double-byte characters:
0xA1..0xFE
.
Legal byte range for two bytes preceded by SS3:
0xA1..0xFE
.
Character order (invalid code points are excluded):
0x00..0x7F
, 0x8EA1..0x8EFE
, 0xA1A1..0xFEFE
,
0x8FA1A1..0x8FFEFE
.
EUC-JP EUC-JISX0213 1. 0x00..0x7F C0/G0 ASCII ASCII 2. 0x8EA1..0x8EFE G2 JIS kana JIS kana 3. 0xA1A1..0xFEFE G1 JIS X 0208 JIS X 0213 plane-1 4. 0x8FA1A1..0x8FFEFE G3 JIS X 0212 JIS X 0213 plane-2
C1 controls other than SS2/SS3 (0x80..0x8D
and 0x90..0x9E
)
are not supported.
Unassigned G2 (0x8EE0..0x8EFE
) are now included.
String::Multibyte::EUC_JP - internally used by String::Multibyte for EUC-JP |