xref: /openbsd-src/regress/usr.bin/mandoc/char/unicode/input.in (revision f2da64fbbbf1b03f09f390ab01267c93dfd77c4c)
CHAR-UNICODE-INPUT 1 "December 19, 2014" OpenBSD
NAME
char-unicode-input - Unicode characters in the input file
DESCRIPTION
lowest valid: €
One-byte range
U+0000:0x00:\[u0000]�:lowest ASCII
U+001f:0x1f:\[u001F]:highest ASCII control character
U+007f:0x7f:\[u007F]:highest ASCII
:0x80:�:leading lowest continuation
:0xbf:�:leading highest continuation
Two-byte range
U+0000:0xc080:��:lowest obfuscated ASCII
U+007f:0xc1bf:��:highest obfuscated ASCII
:0xc278:�x:ASCII continuation
U+0080:0xc280:\[u0080]€:lowest two-byte
:0xc2c380:�À:high continuation
U+07FF:0xdfbf:\[u07FF]߿:highest two-byte
Three-byte range
U+0000:0xe08080:���:lowest obfuscated ASCII
U+007f:0xe081bf:���:highest obfuscated ASCII
U+0080:0xe08280:���:lowest obfuscated two-byte
U+07FF:0xe09fbf:���:highest obfuscated two-byte
U+0800:0xe0a080:\[u0800]ࠀ:lowest three-byte
U+0FFF:0xe0bfbf:\[u0FFF]࿿:end of first middle byte
U+1000:0xe18080:\[u1000]က:begin of second middle byte
U+CFFF:0xecbfbf:\[uCFFF]쿿:end of last normal middle byte
U+D000:0xed8080:\[uD000]퀀:begin of strange middle byte
U+D7FF:0xed9fbf:\[uD7FF]퟿:highest public three-byte
U+D800:0xeda080:\[uD800]�:lowest surrogate
U+DFFF:0xedbfbf:\[uDFFF]�:highest surrogate
U+E000:0xee8080:\[uE000]:lowest private use
U+FFFF:0xefbfbf:\[uFFFF]￿:highest three-byte
Four-byte range
U+0000:0xf0808080:����:lowest obfuscated ASCII
U+007f:0xf08081bf:����:highest obfuscated ASCII
U+0080:0xf0808280:����:lowest obfuscated two-byte
U+07FF:0xf0809fbf:����:highest obfuscated two-byte
U+0800:0xf080a080:����:lowest obfuscated three-byte
U+FFFF:0xf08fbfbf:����:highest obfuscated three-byte
U+10000:0xf0908080:\[u10000]��:lowest four-byte
U+3FFFF:0xf0bfbfbf:\[u3FFFF]��:end of first middle byte
U+40000:0xf1808080:\[u40000]��:second middle byte
U+FFFFF:0xf3bfbfbf:\[uFFFFF]��:last normal middle byte
U+100000:0xf4808080:\[u100000]��:strange middle byte
U+10FFFF:0xf48fbfbf:\[u10FFFF]��:last valid four-byte
U+110000:0xf4908080:\[u110000]����:lowest beyond Unicode
U+13FFFF:0xf4bfbfbf:\[u13FFFF]����:end of strange middle byte
U+140000:0xf5808080:\[u140000]����:lowest invalid middle byte
U+1FFFFF:0xf7bfbfbf:\[u1FFFFF]����:highest four-byte
U+200000:0xf888808080:\[u200000]�����:lowest five-byte