首页 > HTML基础教程 > HTML 字符集
此章节介绍:

    HTML 字符集

    为了正确显示 HTML 页面,Web 浏览器必须知道要使用哪个字符集。


    从 ASCII 到 UTF-8

    ASCII 是第一个字符编码标准。ASCII 定义了 128 种可以在互联网上使用的字符:数字(0-9)、英文字母(A-Z)和一些特殊字符,比如:! $ + - ( ) @ < >。

    ISO-8859-1 是 HTML 4 的默认字符集。此字符集支持 256 个不同的字符代码。HTML 4 同时支持 UTF-8。

    ANSI(Windows-1252)是原始的 Windows 字符集。 ANSI 与 ISO-8859-1 相同,不同之处在于 ANSI 具有 32 个额外的字符。

    HTML5 规范鼓励 Web 开发人员使用 UTF-8 字符集,该字符集涵盖了世界上几乎所有的字符和符号!


    HTML charset 属性

    为了正确显示 HTML 页面,Web 浏览器必须了解页面中使用的字符集。

    这在 <meta> 标签中指定:

    <meta charset="UTF-8">



    字符集之间的差异

    下表展示了上述字符集之间的区别:

    NumbASCIIANSI8859UTF-8Description
    32



    space
    33!!!!exclamation mark
    34""""quotation mark
    35####number sign
    36$$$$dollar sign
    37%%%%percent sign
    38&&&&ampersand
    39''''apostrophe
    40((((left parenthesis
    41))))right parenthesis
    42****asterisk
    43++++plus sign
    44,,,,comma
    45----hyphen-minus
    46....full stop
    47////solidus
    480000digit zero
    491111digit one
    502222digit two
    513333digit three
    524444digit four
    535555digit five
    546666digit six
    557777digit seven
    568888digit eight
    579999digit nine
    58::::colon
    59;;;;semicolon
    60<<<<less-than sign
    61====equals sign
    62>>>>greater-than sign
    63????question mark
    64@@@@commercial at
    65AAAALatin capital letter A
    66BBBBLatin capital letter B
    67CCCCLatin capital letter C
    68DDDDLatin capital letter D
    69EEEELatin capital letter E
    70FFFFLatin capital letter F
    71GGGGLatin capital letter G
    72HHHHLatin capital letter H
    73IIIILatin capital letter I
    74JJJJLatin capital letter J
    75KKKKLatin capital letter K
    76LLLLLatin capital letter L
    77MMMMLatin capital letter M
    78NNNNLatin capital letter N
    79OOOOLatin capital letter O
    80PPPPLatin capital letter P
    81QQQQLatin capital letter Q
    82RRRRLatin capital letter R
    83SSSSLatin capital letter S
    84TTTTLatin capital letter T
    85UUUULatin capital letter U
    86VVVVLatin capital letter V
    87WWWWLatin capital letter W
    88XXXXLatin capital letter X
    89YYYYLatin capital letter Y
    90ZZZZLatin capital letter Z
    91[[[[left square bracket
    92\\\\reverse solidus
    93]]]]right square bracket
    94^^^^circumflex accent
    95____low line
    96````grave accent
    97aaaaLatin small letter a
    98bbbbLatin small letter b
    99ccccLatin small letter c
    100ddddLatin small letter d
    101eeeeLatin small letter e
    102ffffLatin small letter f
    103ggggLatin small letter g
    104hhhhLatin small letter h
    105iiiiLatin small letter i
    106jjjjLatin small letter j
    107kkkkLatin small letter k
    108llllLatin small letter l
    109mmmmLatin small letter m
    110nnnnLatin small letter n
    111ooooLatin small letter o
    112ppppLatin small letter p
    113qqqqLatin small letter q
    114rrrrLatin small letter r
    115ssssLatin small letter s
    116ttttLatin small letter t
    117uuuuLatin small letter u
    118vvvvLatin small letter v
    119wwwwLatin small letter w
    120xxxxLatin small letter x
    121yyyyLatin small letter y
    122zzzzLatin small letter z
    123{{{{left curly bracket
    124||||vertical line
    125}}}}right curly bracket
    126~~~~tilde
    127DEL    
    128   euro sign
    129 NOT USED
    130   single low-9 quotation mark
    131 ƒ  Latin small letter f with hook
    132   double low-9 quotation mark
    133   horizontal ellipsis
    134   dagger
    135   double dagger
    136 ˆ  modifier letter circumflex accent
    137   per mille sign
    138 Š  Latin capital letter S with caron
    139   single left-pointing angle quotation mark
    140 Œ  Latin capital ligature OE
    141 NOT USED
    142 Ž  Latin capital letter Z with caron
    143 NOT USED
    144 NOT USED
    145   left single quotation mark
    146   right single quotation mark
    147   left double quotation mark
    148   right double quotation mark
    149   bullet
    150   en dash
    151   em dash
    152 ˜  small tilde
    153   trade mark sign
    154 š  Latin small letter s with caron
    155   single right-pointing angle quotation mark
    156 œ  Latin small ligature oe
    157 NOT USED
    158 ž  Latin small letter z with caron
    159 Ÿ  Latin capital letter Y with diaeresis
    160    no-break space
    161 ¡¡¡inverted exclamation mark
    162 ¢¢¢cent sign
    163 £££pound sign
    164 ¤¤¤currency sign
    165 ¥¥¥yen sign
    166 ¦¦¦broken bar
    167 §§§section sign
    168 ¨¨¨diaeresis
    169 ©©©copyright sign
    170 ªªªfeminine ordinal indicator
    171 «««left-pointing double angle quotation mark
    172 ¬¬¬not sign
    173 ­­­soft hyphen
    174 ®®®registered sign
    175 ¯¯¯macron
    176 °°°degree sign
    177 ±±±plus-minus sign
    178 ²²²superscript two
    179 ³³³superscript three
    180 ´´´acute accent
    181 µµµmicro sign
    182 pilcrow sign
    183 ···middle dot
    184 ¸¸¸cedilla
    185 ¹¹¹superscript one
    186 ºººmasculine ordinal indicator
    187 »»»right-pointing double angle quotation mark
    188 ¼¼¼vulgar fraction one quarter
    189 ½½½vulgar fraction one half
    190 ¾¾¾vulgar fraction three quarters
    191 ¿¿¿inverted question mark
    192 ÀÀÀLatin capital letter A with grave
    193 ÁÁÁLatin capital letter A with acute
    194 ÂÂÂLatin capital letter A with circumflex
    195 ÃÃÃLatin capital letter A with tilde
    196 ÄÄÄLatin capital letter A with diaeresis
    197 ÅÅÅLatin capital letter A with ring above
    198 ÆÆÆLatin capital letter AE
    199 ÇÇÇLatin capital letter C with cedilla
    200 ÈÈÈLatin capital letter E with grave
    201 ÉÉÉLatin capital letter E with acute
    202 ÊÊÊLatin capital letter E with circumflex
    203 ËËËLatin capital letter E with diaeresis
    204 ÌÌÌLatin capital letter I with grave
    205 ÍÍÍLatin capital letter I with acute
    206 ÎÎÎLatin capital letter I with circumflex
    207 ÏÏÏLatin capital letter I with diaeresis
    208 ÐÐÐLatin capital letter Eth
    209 ÑÑÑLatin capital letter N with tilde
    210 ÒÒÒLatin capital letter O with grave
    211 ÓÓÓLatin capital letter O with acute
    212 ÔÔÔLatin capital letter O with circumflex
    213 ÕÕÕLatin capital letter O with tilde
    214 ÖÖÖLatin capital letter O with diaeresis
    215 ×××multiplication sign
    216 ØØØLatin capital letter O with stroke
    217 ÙÙÙLatin capital letter U with grave
    218 ÚÚÚLatin capital letter U with acute
    219 ÛÛÛLatin capital letter U with circumflex
    220 ÜÜÜLatin capital letter U with diaeresis
    221 ÝÝÝLatin capital letter Y with acute
    222 ÞÞÞLatin capital letter Thorn
    223 ßßßLatin small letter sharp s
    224 àààLatin small letter a with grave
    225 áááLatin small letter a with acute
    226 âââLatin small letter a with circumflex
    227 ãããLatin small letter a with tilde
    228 äääLatin small letter a with diaeresis
    229 åååLatin small letter a with ring above
    230 æææLatin small letter ae
    231 çççLatin small letter c with cedilla
    232 èèèLatin small letter e with grave
    233 éééLatin small letter e with acute
    234 êêêLatin small letter e with circumflex
    235 ëëëLatin small letter e with diaeresis
    236 ìììLatin small letter i with grave
    237 íííLatin small letter i with acute
    238 îîîLatin small letter i with circumflex
    239 ïïïLatin small letter i with diaeresis
    240 ðððLatin small letter eth
    241 ñññLatin small letter n with tilde
    242 òòòLatin small letter o with grave
    243 óóóLatin small letter o with acute
    244 ôôôLatin small letter o with circumflex
    245 õõõLatin small letter o with tilde
    246 öööLatin small letter o with diaeresis
    247 ÷÷÷division sign
    248 øøøLatin small letter o with stroke
    249 ùùùLatin small letter u with grave
    250 úúúLatin small letter u with acute
    251 ûûûLatin small letter with circumflex
    252 üüüLatin small letter u with diaeresis
    253 ýýýLatin small letter y with acute
    254 þþþLatin small letter thorn
    255 ÿÿÿLatin small letter y with diaeresis

    ASCII 字符集

    ASCII 使用 0 到 31(以及 127)之间的值作为控制字符。

    ASCII 使用 32 到 126 的值表示字母、数字和符号。

    ASCII 不使用 128 到 255 之间的值。


    ANSI 字符集 (Windows-1252)

    对于 0 到 127 的值,ANSI 与 ASCII 相同。

    ANSI 有一组专有的字符,其值从 128 到 159。

    对于 160 到 255 的值,ANSI 与 UTF-8 相同。


    ISO-8859-1 字符集

    对于 0 到 127 的值,8859-1 与 ASCII 相同。

    8859-1 不使用 128 到 159 之间的值。

    对于从 160 到 255 的值,8859-1 与 UTF-8 相同。


    UTF-8 字符集

    对于 0 到 127 的值,UTF-8 与 ASCII 相同。

    UTF-8 不使用 12 8到 159 之间的值。

    对于 160 到 255 之间的值,UTF-8 与 ANSI 和 8859-1 相同。

    UTF-8 从值 256 继续,包含超过 10000 个不同字符。

    如需深入研究,请阅读我们完整的 HTML 字符集参考手册。


    @charset CSS 规则

    您可以使用 CSS @charset 规则来指定样式表中使用的字符编码:

    实例

    将样式表的编码设置为 Unicode UTF-8:

    @charset "UTF-8";

    请在我们的 CSS 参考手册中获取更多有关 CSS @charset 规则 的内容。