html_entity_decode

(PHP 4 >= 4.3.0, PHP 5, PHP 7)

html_entity_decodeConvert HTML entities to their corresponding characters

˵��

html_entity_decode ( string $string [, int $flags = ENT_COMPAT | ENT_HTML401 [, string $encoding = ini_get("default_charset") ]] ) : string

html_entity_decode() is the opposite of htmlentities() in that it converts HTML entities in the string to their corresponding characters.

More precisely, this function decodes all the entities (including all numeric entities) that a) are necessarily valid for the chosen document type — i.e., for XML, this function does not decode named entities that might be defined in some DTD — and b) whose character or characters are in the coded character set associated with the chosen encoding and are permitted in the chosen document type. All other entities are left as is.

����

string

The input string.

flags

A bitmask of one or more of the following flags, which specify how to handle quotes and which document type to use. The default is ENT_COMPAT | ENT_HTML401.

Available flags constants
Constant Name Description
ENT_COMPAT Will convert double-quotes and leave single-quotes alone.
ENT_QUOTES Will convert both double and single quotes.
ENT_NOQUOTES Will leave both double and single quotes unconverted.
ENT_HTML401 Handle code as HTML 4.01.
ENT_XML1 Handle code as XML 1.
ENT_XHTML Handle code as XHTML.
ENT_HTML5 Handle code as HTML 5.

encoding

An optional argument defining the encoding used when converting characters.

If omitted, the default value of the encoding varies depending on the PHP version in use. In PHP 5.6 and later, the default_charset configuration option is used as the default value. PHP 5.4 and 5.5 will use UTF-8 as the default. Earlier versions of PHP use ISO-8859-1.

Although this argument is technically optional, you are highly encouraged to specify the correct value for your code if you are using PHP 5.5 or earlier, or if your default_charset configuration option may be set incorrectly for the given input.

֧�������ַ�����

֧�ֵ��ַ����б�
�ַ��� ���� ����
ISO-8859-1 ISO8859-1 ��ŷ��Latin-1
ISO-8859-5 ISO8859-5 Little used cyrillic charset (Latin/Cyrillic).
ISO-8859-15 ISO8859-15 ��ŷ��Latin-9������ŷԪ���ţ�����ͷ�������ĸ�� Latin-1(ISO-8859-1) ��ȱʧ��
UTF-8   ASCII ���ݵĶ��ֽ� 8 λ Unicode��
cp866 ibm866, 866 DOS ���е���������롣���ַ����� 4.3.2 �汾�еõ�֧�֡�
cp1251 Windows-1251, win-1251, 1251 Windows ���е���������롣���ַ����� 4.3.2 �汾�еõ�֧�֡�
cp1252 Windows-1252, 1252 Windows ���е���ŷ���롣
KOI8-R koi8-ru, koi8r ������ַ����� 4.3.2 �汾�еõ�֧�֡�
BIG5 950 �������ģ���Ҫ�����й�̨��ʡ��
GB2312 936 �������ģ��й����ұ�׼�ַ�����
BIG5-HKSCS   �������ģ����������չ�� Big5 �ַ�����
Shift_JIS SJIS, 932 ����
EUC-JP EUCJP ����
MacRoman   Mac OS ʹ�õ��ַ�����
''   An empty string activates detection from script encoding (Zend multibyte), default_charset and current locale (see nl_langinfo() and setlocale()), in this order. Not recommended.

Note: �����ַ���û���Ͽɡ�����ʹ��Ĭ�ϱ��벢�׳��쳣��

����ֵ

Returns the decoded string.

������־

�汾 ˵��
5.6.0 The default value for the encoding parameter was changed to be the value of the default_charset configuration option.
5.4.0 Default encoding changed from ISO-8859-1 to UTF-8.
5.4.0 The constants ENT_HTML401, ENT_XML1, ENT_XHTML and ENT_HTML5 were added.

����

Example #1 Decoding HTML entities

<?php
$orig 
"I'll \"walk\" the <b>dog</b> now";

$a htmlentities($orig);

$b html_entity_decode($a);

echo 
$a// I'll &quot;walk&quot; the &lt;b&gt;dog&lt;/b&gt; now

echo $b// I'll "walk" the <b>dog</b> now
?>

ע��

Note:

You might wonder why trim(html_entity_decode('&nbsp;')); doesn't reduce the string to an empty string, that's because the '&nbsp;' entity is not ASCII code 32 (which is stripped by trim()) but ASCII code 160 (0xa0) in the default ISO 8859-1 encoding.

�μ�