HTML CHARSET


In order to display characters like alphabets, numbers, and other symbols correctly, browsers use a special kind of coding called HTML charset or HTML encoding.

HTML Charset is also called HTML Character Sets which is used to display an HTML page properly and correctly. It has different character encoding standards which assign unique numbers to every character which can then be used by browsers to display the web pages correctly.


HTML CHARACTER ENCODING


HTML has various kinds of Character Encoding which are given below:


ASCII CHARACTER SET


The first-ever character encoding standard used in HTML is the ASCII standard which stands for American Standard Code for Information Interchange. It has 128 different alphanumeric characters that could be used on the internet for numbers (0-9), English letters (A-Z) and (a-z), and some special characters like ! $ + - ( ) @ <>.


ANSI CHARACTER SET


ANSI stands for American National Standards Institute is an advanced version of ASCII character set standard which has 256 character sets. It is also called Windows-1252 and was also used as the default character set for Windows up to Windows 95.


ISO-8859-1 CHARACTER SET


International Standards Organization ISO-8859-1 as the default character encoding in HTML 4.0. It was also an advanced version of the ASCII standard. It supports 256 characters. It contains numbers(0-9), English letters (A-Z), and (a-z), and some special characters.


UTF-8 CHARACTER SET


Both UTF-8 and UTF-16 standards were developed by Unicode Consortium. UTF-8 is the most advanced character encoding system which covers almost all the characters, symbols, and punctuation in the world.


THE HTML CHARSET ATTRIBUTE


The charset used on the page is specified in the <meta> tag.



<meta charset="UTF-8">