What a character set!

What a charset!

Bundesarchiv, Bild 183-58117-0010 / CC-BY-SA 3.0

ISO-8859-1

ISO-8859-2

ISO-8859-6

ISO-8859-8

Photo by Nicholas Lazarine on Unsplash

He always used to refer this guitar, never “Fender guitar” or “Gibson guitar,” it was always the “goddamn guitar.” —Bruce Springsteen talking about his father

When I was growing up there were two things that were unpopular in my house: one was me, and the other one was my guitar. —Bruce Springsteen

‫שלום‬

ISO-8859-8 FD E5 EC F9 visuell

ISO-8859-8-I F9 EC E5 FD ‫ם ו ל ש‬ logisch

character set ≠ character encoding

a ä “ Unicode U+0061 U+00E4 U+201C HTML escapes ä “ ä “

<p>Anton&#xED;n Dvo&#x159;&#xE1;k</p> <p>Antonín Dvořák</p>

a ä “ Unicode U+0061 U+00E4 U+201C HTML escapes ä “ ä “

<p>Antonín Dvořák – der weltweit meistgespielte tschechische Komponist</p> <p>Antonín Dvořák &ndash; der weltweit meistgespielte tschechische Komponist</p>

Unicode a ä “ BOM U+0061 U+00E4 U+201C U+FEFF HTML escapes ä “ ä “ UTF-16 BE 00 61 00 E4 20 1C FE FF UTF-16 LE 61 00 E4 00 1C 20 FF FE

Unicode a ä “ 😝 BOM U+0061 U+00E4 U+201C U+FEFF HTML escapes ä “ ä “ U+1F61D 😝 UTF-16 BE 00 61 00 E4 20 1C FE FF D8 3D DE 1D UTF-16 LE 61 00 E4 00 1C 20 FF FE 1D DE 3D D8 ≫ ← ‘😝’.length 2

Unicode a ä “ BOM U+0061 U+00E4 U+201C U+FEFF HTML escapes ä “ ä “ 😝 U+1F61D 😝 UTF-16 BE 00 61 00 E4 20 1C FE FF D8 3D DE 1D UTF-16 LE 61 00 E4 00 1C 20 FF FE 1D DE 3D D8 UTF-32 BE 00 00 00 61 00 00 00 E4 00 00 20 1C 00 00 FE FF 00 01 F6 1D

Unicode a ä “ BOM U+0061 U+00E4 U+201C U+FEFF HTML escapes ä “ ä “ 😝 U+1F61D 😝 UTF-16 BE 00 61 00 E4 20 1C FE FF D8 3D DE 1D UTF-16 LE 61 00 E4 00 1C 20 FF FE 1D DE 3D D8 UTF-32 BE 00 00 00 61 00 00 00 E4 00 00 20 1C 00 00 FE FF 00 01 F6 1D 61 C3 A4 E2 80 9C EF BB BF F0 98 9F 9C UTF-8

character set Unicode a ä “ BOM U+0061 U+00E4 U+201C U+FEFF 😝 U+1F61D UTF-16 BE 00 61 00 E4 20 1C FE FF D8 3D DE 1D UTF-16 LE 61 00 E4 00 1C 20 FF FE 1D DE 3D D8 UTF-32 BE 00 00 00 61 00 00 00 E4 00 00 20 1C 00 00 FE FF 00 01 F6 1D 61 C3 A4 E2 80 9C EF BB BF F0 98 9F 9C UTF-8 character encoding

character encoding HTML

<meta charset=”UFT-8”/> XML <?xml encoding=”UFT-8”?>

C3 A4 character encoding U+00E4 LATIN SMALL LETTER A WITH DIAERESIS font ä

OPENTYPE FEATURES

charset ≠ character set

charset = character encoding

The end.