hackernews client

jech

7 hours ago

That was a long time ago.

Traditionally, character's under Unix were encoded in a locale-specific manner: ISO 8859-1 in Western Europe, ISO 8859-2 in Eastern Europe, EUC-JP in Japan, etc. In the 1990s, there was a major push to get XFree86 (the ancestor of X.Org) to switch to locale-independent UTF-8, lead mainly by Markus Kuhn and Bruno Haible.

The link is to Markus Kuhn's web page, which appears to describe the UTF_8 software available around 1998 or so.

sheept

5 hours ago

UTF-8 is not locale independent. You cannot correctly render multilingual UTF-8 text without also specifying its locale, and some transformations like uppercase/lowercase also depend on the locale.

Joker_vD

4 hours ago

> You cannot correctly render multilingual UTF-8 text without also specifying its locale

You can render it pretty well, not perfect, but good enough to actually read it, as opposed to not being able to render it at all or rendering mojibake à la РљСЂР°РєРѕР·СЏР±СЂС‹ instead.

Unicode Fonts and Tools for X11

7 Comments

jech

sheept

Joker_vD

numpad0

sourcegrift

j16sdiz

ufocia