Convert HTML entities to their corresponding characters

˵��

html_entity_decode ( string $string [, int $flags = ENT_COMPAT | ENT_HTML401 [, string $encoding = ini_get("default_charset") ]] ) : string

html_entity_decode() is the opposite of htmlentities() in that it converts HTML entities in the string to their corresponding characters.

More precisely, this function decodes all the entities (including all numeric entities) that a) are necessarily valid for the chosen document type — i.e., for XML, this function does not decode named entities that might be defined in some DTD — and b) whose character or characters are in the coded character set associated with the chosen encoding and are permitted in the chosen document type. All other entities are left as is.

��

string

The input string.

flags

A bitmask of one or more of the following flags, which specify how to handle quotes and which document type to use. The default is ENT_COMPAT | ENT_HTML401.

**Available `flags` constants**
Constant Name	Description
`ENT_COMPAT`	Will convert double-quotes and leave single-quotes alone.
`ENT_QUOTES`	Will convert both double and single quotes.
`ENT_NOQUOTES`	Will leave both double and single quotes unconverted.
`ENT_HTML401`	Handle code as HTML 4.01.
`ENT_XML1`	Handle code as XML 1.
`ENT_XHTML`	Handle code as XHTML.
`ENT_HTML5`	Handle code as HTML 5.

encoding

An optional argument defining the encoding used when converting characters.

If omitted, the default value of the encoding varies depending on the PHP version in use. In PHP 5.6 and later, the default_charset configuration option is used as the default value. PHP 5.4 and 5.5 will use UTF-8 as the default. Earlier versions of PHP use ISO-8859-1.

Although this argument is technically optional, you are highly encouraged to specify the correct value for your code if you are using PHP 5.5 or earlier, or if your default_charset configuration option may be set incorrectly for the given input.

֧��ַ��

**֧�ֵ��ַ��б�**
�ַ��	��	��
ISO-8859-1	ISO8859-1	��ŷ��Latin-1
ISO-8859-5	ISO8859-5	Little used cyrillic charset (Latin/Cyrillic).
ISO-8859-15	ISO8859-15	��ŷ��Latin-9��ŷԪ��ţ��ͷ��ĸ�� Latin-1(ISO-8859-1) ��ȱʧ��
UTF-8		ASCII ��ݵĶ��ֽ� 8 λ Unicode��
cp866	ibm866, 866	DOS ��е��롣��ַ�� 4.3.2 �汾�еõ�֧�֡�
cp1251	Windows-1251, win-1251, 1251	Windows ��е��롣��ַ�� 4.3.2 �汾�еõ�֧�֡�
cp1252	Windows-1252, 1252	Windows ��е��ŷ��롣
KOI8-R	koi8-ru, koi8r	����ַ�� 4.3.2 �汾�еõ�֧�֡�
BIG5	950	��ģ��Ҫ��й�̨��ʡ��
GB2312	936	��ģ��й��ұ�׼�ַ��
BIG5-HKSCS		��ģ��չ�� Big5 �ַ��
Shift_JIS	SJIS, 932	��
EUC-JP	EUCJP	��
MacRoman		Mac OS ʹ�õ��ַ��
''		An empty string activates detection from script encoding (Zend multibyte), default_charset and current locale (see nl_langinfo() and setlocale()), in this order. Not recommended.

Note: ��ַ��û��Ͽɡ��ʹ��Ĭ�ϱ��벢�׳��쳣��

��ֵ

Returns the decoded string.

��־

�汾	˵��
5.6.0	The default value for the `encoding` parameter was changed to be the value of the default_charset configuration option.
5.4.0	Default encoding changed from ISO-8859-1 to UTF-8.
5.4.0	The constants `ENT_HTML401`, `ENT_XML1`, `ENT_XHTML` and `ENT_HTML5` were added.

��

Example #1 Decoding HTML entities


<?php
$orig = "I'll \"walk\" the <b>dog</b> now";

$a = htmlentities($orig);

$b = html_entity_decode($a);

echo $a; // I'll &quot;walk&quot; the &lt;b&gt;dog&lt;/b&gt; now

echo $b; // I'll "walk" the <b>dog</b> now
?>

ע��

Note:
You might wonder why trim(html_entity_decode(' ')); doesn't reduce the string to an empty string, that's because the ' ' entity is not ASCII code 32 (which is stripped by trim()) but ASCII code 160 (0xa0) in the default ISO 8859-1 encoding.

�μ�

htmlentities() - ��ַ�ת��Ϊ HTML ת��ַ�
htmlspecialchars() - ��ַ�ת��Ϊ HTML ʵ��
get_html_translation_table() - ��ʹ�� htmlspecialchars �� htmlentities ��ת��
urldecode() - ��ѱ�� URL �ַ��

html_entity_decode

˵��

����

����ֵ

������־

����

ע��

�μ�

��

��ֵ

��־

��