-
-
13 Oct 2021 08:39:09 UTC
- Distribution: Encode
- Module version: 2.05
- Source (raw)
- Browse (raw)
- Changes
- How to Contribute
- Repository
- Issues (49)
- Testers (254 / 0 / 0)
- Kwalitee
Bus factor: 5- % Coverage
- License: perl_5
- Activity
24 month- Tools
- Download (1.96MB)
- MetaCPAN Explorer
- Permissions
- Subscribe to distribution
- Permalinks
- This version
- Latest version
and 83 contributors-
Dan Kogai
-
Alex Davies
-
Alex Kapranoff
-
Alex Vandiver
-
Andreas J. Koenig
-
Andrew Pennebaker
-
Andy Grundman
-
Anton Tagunov
-
Autrijus Tang
-
Benjamin Goldberg
-
Bjoern Hoehrmann
-
Bjoern Jacke
-
bulk88
-
Craig A. Berry
-
Curtis Jewell
-
Dave Evans
-
David Golden
-
David Steinbrunner
-
Deng Liu
-
Dominic Dunlop
- drry
-
Elizabeth Mattijsen
-
Flavio Poletti
-
Gerrit P. Haase
-
Gisle Aas
-
Graham Barr
-
Graham Knop
-
Graham Ollis
-
Gurusamy Sarathy
-
H.Merijn Brand
-
Hugo van der Sanden
-
chansen
-
Chris Nandor
-
Inaba Hiroto
-
Jarkko Hietaniemi
-
Jesse Vincent
-
Jungshik Shin
-
Karen Etheridge
-
Karl Williamson
-
Kenichi Ishigaki
-
KONNO Hiroharu
-
Laszlo Molnar
-
Makamaka
-
Mark-Jason Dominus
-
Masahiro Iuchi
-
MATSUNO Tokuhiro
-
Mattia Barbon
-
Michael G Schwern
-
Michael LaGrasta
-
Miron Cuperman
-
Moritz Lenz
-
MORIYAMA Masayuki
-
Nick Ing-Simmons
-
Nicholas Clark
-
Olivier Mengué
- otsune
-
Pali
-
Paul Marquess
-
Peter Prymmer
-
Peter Rabbitson
-
Philip Newton
-
Piotr Fusik
-
Rafael Garcia-Suarez
-
Randy Stauner
-
Reini Urban
-
Robin Barker
-
SADAHIRO Tomoyuki
-
Simon Cozens
-
Slaven Rezic
-
Spider Boardman
-
Steve Hay
-
Steve Peters
-
SUGAWARA Hajime
-
SUZUKI Norio
-
szr8
-
Tatsuhiko Miyagawa
-
Tels
-
Tony Cook
-
Vadim Konovalov
-
Victor
-
Ville Skyttä
-
Vincent van Dam
-
Yitzchak Scott-Thoennes
- Dependencies
- Exporter
- Storable
- parent
- and possibly others
- Reverse dependencies
- CPAN Testers List
- Dependency graph
NAME
Encode::JP - Japanese Encodings
SYNOPSIS
use Encode qw/encode decode/; $euc_jp = encode("euc-jp", $utf8); # loads Encode::JP implicitly $utf8 = decode("euc-jp", $euc_jp); # ditto
ABSTRACT
This module implements Japanese charset encodings. Encodings supported are as follows.
Canonical Alias Description -------------------------------------------------------------------- euc-jp /\beuc.*jp$/i EUC (Extended Unix Character) /\bjp.*euc/i /\bujis$/i shiftjis /\bshift.*jis$/i Shift JIS (aka MS Kanji) /\bsjis$/i 7bit-jis /\bjis$/i 7bit JIS iso-2022-jp ISO-2022-JP [RFC1468] = 7bit JIS with all Halfwidth Kana converted to Fullwidth iso-2022-jp-1 ISO-2022-JP-1 [RFC2237] = ISO-2022-JP with JIS X 0212-1990 support. See below MacJapanese Shift JIS + Apple vendor mappings cp932 /\bwindows-31j$/i Code Page 932 = Shift JIS + MS/IBM vendor mappings jis0201-raw JIS0201, raw format jis0208-raw JIS0208, raw format jis0212-raw JIS0212, raw format --------------------------------------------------------------------
DESCRIPTION
To find out how to use this module in detail, see Encode.
Note on ISO-2022-JP(-1)?
ISO-2022-JP-1 (RFC2237) is a superset of ISO-2022-JP (RFC1468) which adds support for JIS X 0212-1990. That means you can use the same code to decode to utf8 but not vice versa.
$utf8 = decode('iso-2022-jp-1', $stream);
and
$utf8 = decode('iso-2022-jp', $stream);
yield the same result but
$with_0212 = encode('iso-2022-jp-1', $utf8);
is now different from
$without_0212 = encode('iso-2022-jp', $utf8 );
In the latter case, characters that map to 0212 are first converted to U+3013 (0xA2AE in EUC-JP; a white square also known as 'Tofu' or 'geta mark') then fed to the decoding engine. U+FFFD is not used, in order to preserve text layout as much as possible.
BUGS
The ASCII region (0x00-0x7f) is preserved for all encodings, even though this conflicts with mappings by the Unicode Consortium.
SEE ALSO
Module Install Instructions
To install Encode, copy and paste the appropriate command in to your terminal.
cpanm Encode
perl -MCPAN -e shell install Encode
For more information on module installation, please visit the detailed CPAN module installation guide.