ARIB STD B24 character set

ARIB STB-B24 encoding
StandardARIB STB-B24
ClassificationISO 2022 profile/extension
Transforms / EncodesARIB STB-B24 Kanji, Kana and mosaic sets,
JIS X 0201
ARIB STB-B24 Kanji set
ARIB Extended Font (Weather Symbols) ja.svg
Weather symbols: a few of the extended symbols included.
Language(s)Japanese, English, Russian
Partial support: Greek, Chinese
StandardARIB STB-B24
ClassificationISO-2022-structured CJK DBCS
ExtendsJIS X 0208
Encoding formats
  • ARIB STB-B24 encoding (ISO 2022 based)
  • Shift JIS (ARIB variant)[1]

The Association of Radio Industries and Businesses (ARIB) STD-B24 standard for Broadcast Markup Language[2] specifies, amongst other details, a character encoding for use in Japanese-language broadcasting. It was introduced on 1999-10-26.[2] The latest revision is version 6.3 as of 2016-07-06.

It includes a number of ARIB extended characters (ARIB外字, ARIB gaiji) not found in the base standards (JIS X 0208 and JIS X 0201). It was the source standard for many symbol characters which were added to Unicode, including portions of the Miscellaneous Symbols, Enclosed Alphanumeric Supplement and Enclosed Ideographic Supplement blocks.[3] Its contributions partially overlap the Unicode emoji, but were added a year earlier, in Unicode 5.2.[4]

The ARIB STD-B62 standard, published in 2014, defines Unicode mappings for a selection of the B24 extended characters (excluding, for example, those duplicated by JIS X 0213), as well as a few extended Kanji.[5] It also includes a mapping of utilised characters outside the Basic Multilingual Plane to the BMP's private use area.

Sets and codes[edit]

The ARIB STD B24 standard defines multiple character sets and a method of switching between them. These include a Kanji set (an extension of JIS X 0208), an Alphanumeric set, a Hiragana set, Katakana sets of two distinct layouts and four mosaic sets.[6] The sets are selected using ISO 2022 mechanisms for 94-sets, using the following codes (proportional sets use the same layout as the corresponding non-proportional ones):[7]

Set Type Code (column/line) Code (hexadecimal) Code (ASCII character) Comments
Kanji 2-byte 4/2 42 B The escape code B used for the ARIB Kanji set[7] is used for the 1983 version of JIS C 6226 (JIS X 0208, of which the ARIB Kanji set is an extension) in ISO-2022-JP.[8][9]
Alphanumeric 1-byte 4/10 4A J JIS_C6220-ro (ISO646-JP, JIS X 0201 Roman set). Similar to ASCII, with two assignments differing. Escape code J matches usage in ISO-2022-JP.[9]
Proportional alphanumeric 1-byte 3/6 36 6
Hiragana 1-byte 3/0 30 0 Hiragana themselves follow the same layout as row 4 of JIS X 0208, but without a lead byte. Also adds several additional assignments for punctuation.
Proportional Hiragana 1-byte 3/7 37 7
Katakana 1-byte 3/1 31 1 Katakana themselves follow the same layout as row 5 of JIS X 0208, but without a lead byte. Also adds several additional assignments for punctuation.
Proportional Katakana 1-byte 3/8 38 8
JIS X 0201 Katakana 1-byte 4/9 49 I JIS_C6220-jp (JIS X 0201 Kana set). Escape code matches usage in ISO-2022-JP-3.
Mosaic A 1-byte 3/2 32 2 Pseudographics
Mosaic B 1-byte 3/3 33 3
Mosaic C 1-byte 3/4 34 4 Non-spacing pseudographics
Mosaic D 1-byte 3/5 35 5

Code charts[edit]

Kanji (double-byte) set[edit]

This is a double-byte character set extending JIS X 0208.

Lead byte[edit]

The encoding bytes correspond to the row or cell number plus 0x20, or 32 in decimal (see below). Hence, the code set starting with 0x21 has a row number of 1, and its cell 1 has a continuation byte of 0x21 (or 33), and so forth. Most of the code corresponds to JIS X 0208, exceptions are shown with a heavy border.

ARIB STD-B24 Kanji (double-byte) set (lead bytes)
_0 _1 _2 _3 _4 _5 _6 _7 _8 _9 _A _B _C _D _E _F
2_ SP
0020
 
Punct.
LEAD
1-_
Symbol
LEAD
2-_
Alnum.
LEAD
3-_
Hira.
LEAD
4-_
Kata.
LEAD
5-_
Greek
LEAD
6-_
Cyrillic
LEAD
7-_
Box
LEAD
8-_
 
 
9-_
 
 
10-_
 
 
11-_
 
 
12-_
 
 
13-_
 
 
14-_
 
 
15-_
3_ Kanji L1
LEAD
16-_
Kanji L1
LEAD
17-_
Kanji L1
LEAD
18-_
Kanji L1
LEAD
19-_
Kanji L1
LEAD
20-_
Kanji L1
LEAD
21-_
Kanji L1
LEAD
22-_
Kanji L1
LEAD
23-_
Kanji L1
LEAD
24-_
Kanji L1
LEAD
25-_
Kanji L1
LEAD
26-_
Kanji L1
LEAD
27-_
Kanji L1
LEAD
28-_
Kanji L1
LEAD
29-_
Kanji L1
LEAD
30-_
Kanji L1
LEAD
31-_
4_ Kanji L1
LEAD
32-_
Kanji L1
LEAD
33-_
Kanji L1
LEAD
34-_
Kanji L1
LEAD
35-_
Kanji L1
LEAD
36-_
Kanji L1
LEAD
37-_
Kanji L1
LEAD
38-_
Kanji L1
LEAD
39-_
Kanji L1
LEAD
40-_
Kanji L1
LEAD
41-_
Kanji L1
LEAD
42-_
Kanji L1
LEAD
43-_
Kanji L1
LEAD
44-_
Kanji L1
LEAD
45-_
Kanji L1
LEAD
46-_
Kanji L1
LEAD
47-_
5_ Kanji L2
LEAD
48-_
Kanji L2
LEAD
49-_
Kanji L2
LEAD
50-_
Kanji L2
LEAD
51-_
Kanji L2
LEAD
52-_
Kanji L2
LEAD
53-_
Kanji L2
LEAD
54-_
Kanji L2
LEAD
55-_
Kanji L2
LEAD
56-_
Kanji L2
LEAD
57-_
Kanji L2
LEAD
58-_
Kanji L2
LEAD
59-_
Kanji L2
LEAD
60-_
Kanji L2
LEAD
61-_
Kanji L2
LEAD
62-_
Kanji L2
LEAD
63-_
6_ Kanji L2
LEAD
64-_
Kanji L2
LEAD
65-_
Kanji L2
LEAD
66-_
Kanji L2
LEAD
67-_
Kanji L2
LEAD
68-_
Kanji L2
LEAD
69-_
Kanji L2
LEAD
70-_
Kanji L2
LEAD
71-_
Kanji L2
LEAD
72-_
Kanji L2
LEAD
73-_
Kanji L2
LEAD
74-_
Kanji L2
LEAD
75-_
Kanji L2
LEAD
76-_
Kanji L2
LEAD
77-_
Kanji L2
LEAD
78-_
Kanji L2
LEAD
79-_
7_ Kanji L2
LEAD
80-_
Kanji L2
LEAD
81-_
Kanji L2
LEAD
82-_
Kanji L2
LEAD
83-_
Kanji L2
LEAD
84-_
 
 
85-_
 
 
86-_
 
 
87-_
 
 
88-_
 
 
89-_
Traffic
LEAD
90-_
Map
LEAD
91-_
Misc.
LEAD
92-_
Misc.
LEAD
93-_
List
LEAD
94-_
DEL
007F
 

Character sets 0x21-0x74 (row numbers 1-84: punctuation, alphabets, numbers, Kana, Kanji)[edit]

Character set 0x7A (row number 90, traffic symbols)[edit]

Characters 90-45 through 90-63 and 90-66 through 90-84 (shown below with a heavy border) are listed in the B24 standard only in table 7-10 (the list of extension characters), and are also the only characters in rows 90 through 91 which are not transport-related symbols; this is noted in the B24 standard in an endnote to table 7-10.[10] The remainder of the extensions are listed in both table 7-4 (the double-byte code chart) and table 7-10.[10]

ARIB STD-B24 Kanji (double-byte) set (prefixed with 0x7A)[5][11]
_0 _1 _2 _3 _4 _5 _6 _7 _8 _9 _A _B _C _D _E _F
2_
26CC
90-1

26CD
90-2

2757
90-3

26CF
90-4

26D0
90-5

26D1
90-6

 
90-7

26D2
90-8

26D5
90-9

26D3
90-10

26D4
90-11

 
90-12

 
90-13

 
90-14

 
90-15
3_ 🅿
1F17F
90-16
🆊
1F18A
90-17

 
90-18

 
90-19

26D6
90-20

26D7
90-21

26D8
90-22

26D9
90-23

26DA
90-24

26DB
90-25

26DC
90-26

26DD
90-27

26DE
90-28

26DF
90-29

26E0
90-30

26E1
90-31
4_
2B55
90-32

3248
90-33

3249
90-34

324A
90-35

324B
90-36

324C
90-37

324D
90-38

324E
90-39

324F
90-40

 
90-41

 
90-42

 
90-43

 
90-44

2491
90-45

2492
90-46

2493
90-47
5_ 🅊
1F14A
90-48
🅌
1F14C
90-49
🄿
1F13F
90-50
🅆
1F146
90-51
🅋
1F14B
90-52
🈐
1F210
90-53
🈑
1F211
90-54
🈒
1F212
90-55
🈓
1F213
90-56
🅂
1F142
90-57
🈔
1F214
90-58
🈕
1F215
90-59
🈖
1F216
90-60
🅍
1F14D
90-61
🄱
1F131
90-62
🄽
1F13D
90-63
6_
2B1B
90-64

2B24
90-65
🈗
1F217
90-66
🈘
1F218
90-67
🈙
1F219
90-68
🈚
1F21A
90-69
🈛
1F21B
90-70

26BF
90-71
🈜
1F21C
90-72
🈝
1F21D
90-73
🈞
1F21E
90-74
🈟
1F21F
90-75
🈠
1F220
90-76
🈡
1F221
90-77
🈢
1F222
90-78
🈣
1F223
90-79
7_ 🈤
1F224
90-80
🈥
1F225
90-81
🅎
1F14E
90-82

3299
90-83
🈀
1F200
90-84

 
90-85

 
90-86

 
90-87

 
90-88

 
90-89

 
90-90

 
90-91

 
90-92

 
90-93

 
90-94

Character set 0x7B (row number 91, map symbols)[edit]

Characters from ARIB STD-B24 which were not retained in ARIB STD-B62 are shown shaded.

ARIB STD-B24 Kanji (double-byte) set (prefixed with 0x7B)[5][11][12]
_0 _1 _2 _3 _4 _5 _6 _7 _8 _9 _A _B _C _D _E _F
2_
26E3
91-1

2B56
91-2

2B57
91-3

2B58
91-4

2B59
91-5

2613
91-6

328B
91-7

3012
91-8

26E8
91-9

3246
91-10

3245
91-11

26E9
91-12
[a]
0FD6
91-13

26EA
91-14

26EB
91-15
3_
26EC
91-16

2668
91-17

26ED
91-18

26EE
91-19

26EF
91-20

2693
91-21

2708
91-22

26F0
91-23

26F1
91-24

26F2
91-25

26F3
91-26

26F4
91-27

26F5
91-28
🅗
1F157
91-29

24B9
91-30

24C8
91-31
4_
26F6
91-32
🅟
1F15F
91-33
🆋
1F18B
91-34
🆍
1F18D
91-35
🆌
1F18C
91-36
🅹
1F179
91-37

26F7
91-38

26F8
91-39

26F9
91-40

26FA
91-41
🅻
1F17B
91-42

260E
91-43

26FB
91-44

26FC
91-45

26FD
91-46

26FE
91-47
5_ 🅼
1F17C
91-48

26FF
91-49

 
91-50

 
91-51

 
91-52

 
91-53

 
91-54

 
91-55

 
91-56

 
91-57

 
91-58

 
91-59

 
91-60

 
91-61

 
91-62

 
91-63
6_
 
91-64

 
91-65

 
91-66

 
91-67

 
91-68

 
91-69

 
91-70

 
91-71

 
91-72

 
91-73

 
91-74

 
91-75

 
91-76

 
91-77

 
91-78

 
91-79
7_
 
91-80

 
91-81

 
91-82

 
91-83

 
91-84

 
91-85

 
91-86

 
91-87

 
91-88

 
91-89

 
91-90

 
91-91

 
91-92

 
91-93

 
91-94

Character set 0x7C (row number 92, units, enclosed forms, list markers, arrows)[edit]

Characters from ARIB STD-B24 which were not retained in ARIB STD-B62 are shown shaded.

ARIB STD-B24 Kanji (double-byte) set (prefixed with 0x7C)[5][11][12]
_0 _1 _2 _3 _4 _5 _6 _7 _8 _9 _A _B _C _D _E _F
2_
27A1
92-1

2B05
92-2

2B06
92-3

2B07
92-4

2B2F
92-5

2B2E
92-6

5E74
92-7

6708
92-8

65E5
92-9

5186
92-10

33A1
92-11

33A5
92-12

339D
92-13

33A0
92-14

33A4
92-15
3_ 🄀
1F100
92-16

2488
92-17

2489
92-18

248A
92-19

248B
92-20

248C
92-21

248D
92-22

248E
92-23

248F
92-24

2490
92-25
[b]
 
92-26
[b]
 
92-27
[b]
 
92-28
[b]
 
92-29
[b]
 
92-30
[b]
 
92-31
4_ 🄁
1F101
92-32
🄂
1F102
92-33
🄃
1F103
92-34
🄄
1F104
92-35
🄅
1F105
92-36
🄆
1F106
92-37
🄇
1F107
92-38
🄈
1F108
92-39
🄉
1F109
92-40
🄊
1F10A
92-41

3233
92-42

3236
92-43

3232
92-44

3231
92-45

3239
92-46

3244
92-47
5_
25B6
92-48

25C0
92-49

3016
92-50

3017
92-51

27D0
92-52
²
00B2
92-53
³
00B3
92-54
🄭
1F12D
92-55
(vn)[c]
 
92-56
(ob)[c]
 
92-57
(cb)[c]
 
92-58
(ce[c]
 
92-59
mb)[c]
 
92-60
(hp)[c]
 
92-61
(br)[c]
 
92-62
(p)[c]
 
92-63
6_ (s)[c]
 
92-64
(ms)[c]
 
92-65
(t)[c]
 
92-66
(bs)[c]
 
92-67
(b)[c]
 
92-68
(tb)[c]
 
92-69
(tp)[c]
 
92-70
(ds)[c]
 
92-71
(ag)[c]
 
92-72
(eg)[c]
 
92-73
(vo)[c]
 
92-74
(fl)[c]
 
92-75
(ke[c]
 
92-76
y)[c]
 
92-77
(sa[c]
 
92-78
x)[c]
 
92-79
7_ (sy[c]
 
92-80
n)[c]
 
92-81
(or[c]
 
92-82
g)[c]
 
92-83
(pe[c]
 
92-84
r)[c]
 
92-85
🄬
1F12C
92-86
🄫
1F12B
92-87

3247
92-88
🆐
1F190
92-89
🈦
1F226
92-90

213B
92-91

 
92-92

 
92-93

 
92-94

Character set 0x7D (row number 93, game and weather symbols, fractions, units, enclosed forms)[edit]

Characters from ARIB STD-B24 which were not retained in ARIB STD-B62 are shown shaded.

ARIB STD-B24 Kanji (double-byte) set (prefixed with 0x7D)[5][11][12]
_0 _1 _2 _3 _4 _5 _6 _7 _8 _9 _A _B _C _D _E _F
2_
322A
93-1

322B
93-2

322C
93-3

322D
93-4

322E
93-5

322F
93-6

3230
93-7

3237
93-8

337E
93-9

337D
93-10

337C
93-11

337B
93-12

2116
93-13

2121
93-14

3036
93-15
3_
26BE
93-16
🉀
1F240
93-17
🉁
1F241
93-18
🉂
1F242
93-19
🉃
1F243
93-20
🉄
1F244
93-21
🉅
1F245
93-22
🉆
1F246
93-23
🉇
1F247
93-24
🉈
1F248
93-25
🄪
1F12A
93-26
🈧
1F227
93-27
🈨
1F228
93-28
🈩
1F229
93-29
🈔
1F214
93-30
🈪
1F22A
93-31
4_ 🈫
1F22B
93-32
🈬
1F22C
93-33
🈭
1F22D
93-34
🈮
1F22E
93-35
🈯
1F22F
93-36
🈰
1F230
93-37
🈱
1F231
93-38

2113
93-39

338F
93-40

3390
93-41

33CA
93-42

339E
93-43

33A2
93-44

3371
93-45

 
93-46

 
93-47
5_ ½
00BD
93-48

2189
93-49

2153
93-50

2154
93-51
¼
00BC
93-52
¾
00BE
93-53

2155
93-54

2156
93-55

2157
93-56

2158
93-57

2159
93-58

215A
93-59

2150
93-60

215B
93-61

2151
93-62

2152
93-63
6_
2600
93-64

2601
93-65

2602
93-66

26C4
93-67

2616
93-68

2617
93-69

26C9
93-70

26CA
93-71

2666
93-72

2665
93-73

2663
93-74

2660
93-75

26CB
93-76

2A00
93-77

203C
93-78

2049
93-79
7_
26C5
93-80

2614
93-81

26C6
93-82

2603
93-83

26C7
93-84

26A1
93-85

26C8
93-86

 
93-87

269E
93-88

269F
93-89

266C
93-90

260E
93-91

 
93-92

 
93-93

 
93-94

Character set 0x7E (row number 94, list markers)[edit]

Characters from ARIB STD-B24 which were not retained in ARIB STD-B62 are shown shaded.

ARIB STD-B24 Kanji (double-byte) set (prefixed with 0x7E)[5][11][12]
_0 _1 _2 _3 _4 _5 _6 _7 _8 _9 _A _B _C _D _E _F
2_
2160
94-1

2161
94-2

2162
94-3

2163
94-4

2164
94-5

2165
94-6

2166
94-7

2167
94-8

2168
94-9

2169
94-10

216A
94-11

216B
94-12

2470
94-13

2471
94-14

2472
94-15
3_
2473
94-16

2474
94-17

2475
94-18

2476
94-19

2477
94-20

2478
94-21

2479
94-22

247A
94-23

247B
94-24

247C
94-25

247D
94-26

247E
94-27

247F
94-28

3251
94-29

3252
94-30

3253
94-31
4_
3254
94-32
🄐
1F110
94-33
🄑
1F111
94-34
🄒
1F112
94-35
🄓
1F113
94-36
🄔
1F114
94-37
🄕
1F115
94-38
🄖
1F116
94-39
🄗
1F117
94-40
🄘
1F118
94-41
🄙
1F119
94-42
🄚
1F11A
94-43
🄛
1F11B
94-44
🄜
1F11C
94-45
🄝
1F11D
94-46
🄞
1F11E
94-47
5_ 🄟
1F11F
94-48
🄠
1F120
94-49
🄡
1F121
94-50
🄢
1F122
94-51
🄣
1F123
94-52
🄤
1F124
94-53
🄥
1F125
94-54
🄦
1F126
94-55
🄧
1F127
94-56
🄨
1F128
94-57
🄩
1F129
94-58

3255
94-59

3256
94-60

3257
94-61

3258
94-62

3259
94-63
6_
325A
94-64

2460
94-65

2461
94-66

2462
94-67

2463
94-68

2464
94-69

2465
94-70

2466
94-71

2467
94-72

2468
94-73

2469
94-74

246A
94-75

246B
94-76

246C
94-77

246D
94-78

246E
94-79
7_
246F
94-80

2776
94-81

2777
94-82

2778
94-83

2779
94-84

277A
94-85

277B
94-86

277C
94-87

277D
94-88

277E
94-89

277F
94-90

24EB
94-91

24EC
94-92

325B
94-93

 
94-94

Single-byte sets[edit]

Alphanumeric set[edit]

Differences from US-ASCII are shown with a heavy border.

ARIB STD-B24 Alphanumeric set[13]
_0 _1 _2 _3 _4 _5 _6 _7 _8 _9 _A _B _C _D _E _F
2_
32

 
!
0021
"
0022
#
0023
$
0024
%
0025
&
0026
'
0027
(
0028
)
0029
*
002A
+
002B
,
002C
-
002D
.
002E
/
002F
3_
48
0
0030
1
0031
2
0032
3
0033
4
0034
5
0035
6
0036
7
0037
8
0038
9
0039
:
003A
;
003B
<
003C
=
003D
>
003E
?
003F
4_
64
@
0040
A
0041
B
0042
C
0043
D
0044
E
0045
F
0046
G
0047
H
0048
I
0049
J
004A
K
004B
L
004C
M
004D
N
004E
O
004F
5_
80
P
0050
Q
0051
R
0052
S
0053
T
0054
U
0055
V
0056
W
0057
X
0058
Y
0059
Z
005A
[
005B
¥
00A5
]
005D
^
005E
_
005F
6_
96
`
0060
a
0061
b
0062
c
0063
d
0064
e
0065
f
0066
g
0067
h
0068
i
0069
j
006A
k
006B
l
006C
m
006D
n
006E
o
006F
7_
112
p
0070
q
0071
r
0072
s
0073
t
0074
u
0075
v
0076
w
0077
x
0078
y
0079
z
007A
{
007B
|
007C
}
007D

203E

 

Hiragana set[edit]

Character allocations not following row 4 of JIS X 0208 are shown with a heavy border.

ARIB STD-B24 Hiragana set[14]
_0 _1 _2 _3 _4 _5 _6 _7 _8 _9 _A _B _C _D _E _F
2_
32

 

3041

3042

3043

3044

3045

3046

3047

3048

3049

304A

304B

304C

304D

304E

304F
3_
48

3050

3051

3052

3053

3054

3055

3056

3057

3058

3059

305A

305B

305C

305D

305E

305F
4_
64

3060

3061

3062

3063

3064

3065

3066

3067

3068

3069

306A

306B

306C

306D

306E

306F
5_
80

3070

3071

3072

3073

3074

3075

3076

3077

3078

3079

307A

307B

307C

307D

307E

307F
6_
96

3080

3081

3082

3083

3084

3085

3086

3087

3088

3089

308A

308B

308C

308D

308E

308F
7_
112

3090

3091

3092

3093

 

 

 

309D

309E

30FC

3002

300C

300D

3001

30FB

 

Katakana set[edit]

Character allocations not following row 5 of JIS X 0208 are shown with a heavy border.

ARIB STD-B24 Katakana set[15]
_0 _1 _2 _3 _4 _5 _6 _7 _8 _9 _A _B _C _D _E _F
2_
32

 

30A1

30A2

30A3

30A4

30A5

30A6

30A7

30A8

30A9

30AA

30AB

30AC

30AD

30AE

30AF
3_
48

30B0

30B1

30B2

30B3

30B4

30B5

30B6

30B7

30B8

30B9

30BA

30BB

30BC

30BD

30BE

30BF
4_
64

30C0

30C1

30C2

30C3

30C4

30C5

30C6

30C7

30C8

30C9

30CA

30CB

30CC

30CD

30CE

30CF
5_
80

30D0

30D1

30D2

30D3

30D4

30D5

30D6

30D7

30D8

30D9

30DA

30DB

30DC

30DD

30DE

30DF
6_
96

30E0

30E1

30E2

30E3

30E4

30E5

30E6

30E7

30E8

30E9

30EA

30EB

30EC

30ED

30EE

30EF
7_
112

30F0

30F1

30F2

30F3

30F4

30F5

30F6

309D

309E

30FC

3002

300C

300D

3001

30FB

 

JIS X 0201 Katakana set[edit]

ARIB STD-B24 JIS X 0201 Katakana set[16]
_0 _1 _2 _3 _4 _5 _6 _7 _8 _9 _A _B _C _D _E _F
2_

 

FF61

FF62

FF63

FF64

FF65

FF66

FF67

FF68

FF69

FF6A

FF6B

FF6C

FF6D

FF6E

FF6F
3_
48

FF70

FF71

FF72

FF73

FF74

FF75

FF76

FF77

FF78

FF79

FF7A

FF7B

FF7C

FF7D

FF7E
ソ
FF7F
4_
64

FF80

FF81

FF82

FF83

FF84

FF85

FF86

FF87

FF88

FF89

FF8A

FF8B

FF8C

FF8D

FF8E

FF8F
5_
80

FF90

FF91

FF92

FF93

FF94

FF95

FF96

FF97

FF98

FF99

FF9A

FF9B

FF9C

FF9D

FF9E

FF9F
6_
96
7_
112

Mosaic sets[edit]

Shift_JIS variant[edit]

In addition to the modified ISO 2022 encoding, the B24 standard also specifies a Shift JIS encoding following JIS X 0208:1997, but with the addition of the extended characters in the kanji set.[1]

First byte
0 1 2 3 4 5 6 7 8 9 A B C D E F
0
1
2 ! " # $ % & ' ( ) * + , - . /
3 0 1 2 3 4 5 6 7 8 9 : ; < = > ?
4 @ A B C D E F G H I J K L M N O
5 P Q R S T U V W X Y Z [ ¥ ] ^ _
6 ` a b c d e f g h i j k l m n o
7 p q r s t u v w x y z { | }
8
9
A
B ソ
C
D
E
F
Second byte
0 1 2 3 4 5 6 7 8 9 A B C D E F
0
1
2
3
4
5
6
7
8
9
A
B
C
D
E
F
 
Non printable ASCII character
Unaltered ASCII character
Modified ASCII character
Single-byte half-width katakana
First byte of a double-byte character, used by JIS X 0208
First byte of an ARIB extended character
Not used as first byte, unallocated space in JIS X 0208
Not used as first byte
Second byte of a double-byte character whose first half of the JIS sequence was odd
Second byte of a double-byte character whose first half of the JIS sequence was even
Unused as second byte of a double-byte character


See also[edit]

Footnotes[edit]

  1. ^ Glossed as "temple" (i.e. Buddhist temple) in B24 table 7-10 (the list of extension characters).
  2. ^ a b c d e f Small form (70% size per code chart / table 7-10) of a kanji character. Shown here simulated.
  3. ^ a b c d e f g h i j k l m n o p q r s t u v w x y z aa ab ac ad Musical abbreviation (or half thereof) not present in Unicode, simulated here with multiple characters.

References[edit]

  1. ^ a b ARIB (2008), p. 105, part 2, section 7.3
  2. ^ a b ARIB (2008)
  3. ^ Suignard, Michel (2008-03-11). "ISO/IEC JTC1/SC2/WG2 N 3397: Japanese TV Symbols" (PDF).
  4. ^ "Unicode 5.2 Emoji List". Emojipedia.
  5. ^ a b c d e f ARIB (2014), pp. 33–50, part 2, Table 5-2
  6. ^ ARIB (2008), pp. 48-52
  7. ^ a b ARIB (2008), p. 39, part 2, Table 7-3
  8. ^ "ISO-IR-087" (PDF). Information Technology Standards Commission of Japan (IPSJ/ITSCJ).
  9. ^ a b RFC 1468 (IETF)
  10. ^ a b ARIB (2008), p. 72
  11. ^ a b c d e ARIB (2008), pp. 54-72, part 2, Table 7-10
  12. ^ a b c d ARIB (2008), pp. 46-47, part 2, Table 7-4
  13. ^ ARIB (2008), p. 48, part 2, Table 7-5
  14. ^ ARIB (2008), p. 50, part 2, Table 7-7
  15. ^ ARIB (2008), p. 49, part 2, Table 7-6
  16. ^ ARIB (2008), p. 52, part 2, Table 7-9

Further reading[edit]

External links[edit]