Characters
Code Points
UTF-16 Units
UTF-8 Bytes

Try an example!

šŸ§‘šŸ¾ā€ā¤ļøā€šŸ’‹ā€šŸ§‘šŸ» The most complex emoji in the current Unicode standard is composd of 10 code points including skin color modifiers, zero-width joiners, and a variation selector.
SĢ¶tĢ¶rĢ¶iĢ¶kĢ¶eĢ¶Š¾š˜‚š˜ See how combining characters and misusing unusual characters can be used to create interesting text effects and homographs.
ƅ != AĢŠ Learn about composing characters and normalized forms.
ā€®12345ā€¬ This text renders backwards from the order of its characters using BIDI control code points. Inspired by https://trojansource.codes/.
Hi! ‏(שלום!)‏ This example contains bidirectional text with BIDI glyph mirroring and right-to-left markers. Inspired by https://blog.georeactor.com/osm-1.
(שלום!) This bidirectional text displays differently depending on context. Inspired by https://blog.georeactor.com/osm-1.
ā†™ ~ ā†™ļø and 你儽ļ¼ ~ 你儽!︁ Examples of an emoji variation sequence and an East Asian punctuation positional variant using variation selectors.

Send me other interesting Unicode examples at @josh@joshdata.me on Mastodon.


About Unicode.run

Text is unexpectedly complicated. Use Unicode.run to debug text.

Here are some things you can do here:

This is a project by JoshData.

Thanks to ucd-full (based on Unicode 15.1), stdlib-js/string-split-grapheme-clusters (based on Unicode 13), bidi-js (based on Unicode 13), html-entities, and the Inter Typeface.

Nikita Prokopov’s The Absolute Minimum Every Software Developer Must Know About Unicode in 2023 (Still No Excuses!) was inspiration for this project.