TIL warum nicht GROẞ

A presentation at Tech Talk @ Tagesspiegel by Gunnar Bittersmann

Gunnar Bittersmann
@gunnarbittersmann

1 / 16

Unicode und das große ẞ

Resources

The following resources were mentioned during the presentation or are useful additional information.

Unicode Character Database: Special Casing

This file is a supplement to the UnicodeData.txt file. It does not define any properties, but rather provides additional information about the casing of Unicode characters, for situations when casing incurs a change in string length or is dependent on context or locale.
Q&A Session with Mark Davis, President and Co-Founder of Unicode

Mark Davis answers my question: The ẞ (Latin uppercase sharp S) has beed added as U+1E9E a while ago. However, CLDR still defines ß (lowercase sharp s) being uppercased to SS which seems wrong to me as a native German speaker. Are there any plans to change this behavior yet? What would it take to do so?

Buzz and feedback

Here’s what was said about this presentation on social media.

Aber die GRÖẞE, ein großes ẞ zu verwenden, hat @BVG_Kampagne dann doch nicht. https://t.co/74Vpna1HPh pic.twitter.com/p2mXOCv1av
— Gunnar Bittersmann (@g16n) December 30, 2015
Warum nicht BESCHLIEẞT, #golem? #VersalEszett https://t.co/Qn84rfFndN pic.twitter.com/uW3XELgW4e
— Warum nicht GROẞ (@NichtGro) January 17, 2021
Weil im Quelltext „beschließt“ steht & das mit CSS `text-transform: uppercase` umgewandelt wird.
CSS beruft sich auf die Unicode-Vorschrift, die besagt, dass ß zu SS zu transformieren ist. (Siehe @fantasai⁠s Antwort auf https://t.co/aC46WBOB7o.)
Das wäre die zu ändernde Stelle.
— Gunnar Bittersmann (@g16n) January 17, 2021
AFAIS https://t.co/KKiGYo2BTN doesn’t specify the transformation ß→SS, but refers to Unicode https://t.co/fOQsjfbEct.
IMHO, ß U+00DF should be uppercased to ẞ U+1E9E.
Are there any plans to change this behavior/add the option via language subtag `de-…`?
/cc @fantasai @frivoal
— Gunnar Bittersmann (@g16n) November 20, 2020
That's an issue for Unicode / CLDR, not csswg.
— fantasai (@fantasai) November 20, 2020
I’ve asked @mark_e_davis about this. Paraphrasing his answer: Unicode is looking for most-customer usage. ẞ needs to be be predominant over SS before they switch over.
So it’s upon us Germans to widely use the capital ẞ first. Chicken or the egg? … [1/2]
— Gunnar Bittersmann (@g16n) September 28, 2022
How would more people use ẞ when constantly presented with SS? Be it because of small ß in the source code being transformed to SS with CSS? https://t.co/sjnkvB8pHR [2/2]
— Gunnar Bittersmann (@g16n) September 28, 2022
Ab 11:57 in der Q&A mit @mark_e_davis https://t.co/smZZGucAR8
— Gunnar Bittersmann (@g16n) November 12, 2022
Ich hab das mal meinen Kollegen im Tech Talk präsentiert. Unter passendem Titel. 😁 https://t.co/z6DgVBWU9k
— Gunnar Bittersmann (@g16n) November 12, 2022

TIL warum nicht GROẞ

Link for this presentation:

HTML code for embedding:

Share on social media:

Resources

Unicode Character Database: Special Casing

Q&A Session with Mark Davis, President and Co-Founder of Unicode

Buzz and feedback