H.3 Example of Person Name Value Representation in the Japanese Language

Character strings representing person names are encoded using a convention for PN value representations based on component groups with 5 components.

For languages that use ideographic characters, it is sometimes necessary to write names both in ideographic characters and in phonetic characters. Ideographic characters may be required for official purposes, while phonetic characters may be needed for pronunciation and data processing purposes.

For the purpose of writing names in ideographic characters and in phonetic characters, up to 3 component groups may be used. The delimiter of the component group shall be the equals character “=” (3DH). The three component groups in their order of occurrence are: an alphabetic representation, an ideographic representation, and a phonetic representation.

H.3.1 Example 1: Value 1 of Attribute Specific Character Set (0008,0005) is not present.

In this case, ISO-IR 6 is used by default.

(0008,0005) \ISO 2022 IR 87

Character String:

[pic]

Encoded representation:

05/09 06/01 06/13 06/01 06/04 06/01 5/14 05/04 06/01 07/02 06/15 07/05 03/13 01/11 02/04 04/02 03/11 03/03 04/05 04/04 01/11 02/08 04/02 05/14 01/11 02/04 04/02 04/02 04/00 04/15 03/10 01/11 02/08 04/02 03/13 01/11 02/04 04/02 02/04 06/04 02/04 05/14 02/04 04/00 01/11 02/08 04/02 05/14 01/11 02/04 04/02 02/04 03/15 02/04 06/13 02/04 02/06 01/11 02/08 04/02

An example of what might be displayed or printed by an ASCII based machine that displays or prints the Control Character ESC (01/11) using \033:

Yamada^Tarou=\033$B;3ED\033(B^\033$BB@O:\033(B=\033$B$d$^$@\033(B^\033$B$?$m$&\033(B

Table H-1CHARACTER SETS AND ESCAPE SEQUENCES USED IN EXAMPLE 1

Character Set Description Component Group Value of (0008,0005) Defined Term ISO Registration Number Standard for Code Extension ESC Sequence Character Set: Purpose of Use
Japanese First: Single-byte Value 1: none ISO-IR 6 GL ISO 646:
Second: Ideographic Value 2: ISO 2022 IR 87 ISO-IR 87 ISO 2022 ESC 02/04 04/02 GL JIS X 0208: Japanese kanji, hiragana, katakana
Value 1: none ISO-IR 6 ISO 2022 ESC 02/08 04/02 GL ISO 646: for delimiters
Third: Phonetic Value 2: ISO 2022 IR 87 ISO-IR 87 ISO 2022 ESC 02/04 04/02 GL JIS X 0208: Japanese hiragana, and katakana
Value 1: none ISO-IR 6 ISO 2022 ESC 02/08 04/02 GL ISO 646: for delimiters

H.3.2 Example 2: Value 1 of Attribute Specific Character Set (0008,0005) is ISO 2022 IR 13.

(0008,0005) ISO 2022 IR 13\ISO 2022 IR 87

Character String:

[pic]

Encoded representation:

13/04 12/15 12/00 13/14 05/14 12/00 13/11 11/03 03/13 01/11 02/04 04/02 03/11 03/03 04/05 04/04 01/11 02/08 04/10 05/14 01/11 02/04 04/02 04/02 04/00 04/15 03/10 01/11 02/08 04/10 03/13 01/11 02/04 04/02 02/04 06/04 02/04 05/14 02/04 04/00 01/11 02/08 04/10 05/14 01/11 02/04 04/02 02/04 03/15 02/04 06/13 02/04 02/06 01/11 02/08 04/10

An example of what might be displayed or printed by an ASCII based machine that displays or prints the Control Character ESC (01/11) using \033:

\324\3l7\300\336^\300\333\263=\033$B;3ED\033(J^\033$BB@O:\033(J=\033$B$d$^$@\033(J^\033$B$?$m$&\033(J

Table H-2CHARACTER SETS AND ESCAPE SEQUENCES USED IN EXAMPLE 2

Character Set Description Component Group Value of (0008,0005) Defined Term ISO Registration Number Standard for Code Extension ESC Sequence Character Set: Purpose of Use
Japanese First: Single-byte Value 1: ISO 2022 IR 13 ISO-IR 13 ISO 2022 ESC 02/09 04/09 GR JIS X 0201: Japanese katakana
ISO-IR 14 ISO 2022 ESC 02/08 04/10 GL JIS X 0201: Japanese romaji for delimiters
Second: Ideographic Value 2: ISO 2022 IR 87 ISO-IR 87 ISO 2022 ESC 02/04 04/02 GL JIS X 0208: Japanese kanji, hiragana, katakana
Value 1: ISO 2022 IR 13 ISO-IR 14 ISO 2022 ESC 02/08 04/10 GL JIS X 0201: Japanese romaji for delimiters
Third: Phonetic Value 2: ISO 2022 IR 87 ISO-IR 87 ISO 2022 ESC 02/04 04/02 GL JIS X 0208: Japanese hiragana, and katakana
Value 1: ISO 2022 IR 13 ISO-IR 14 ISO 2022 ESC 02/08 04/10 GL JIS X 0201: Japanese romaji for delimiters