Table of Contents
If you want to send us your comments, please do so. Thanks
More on comments
Special characters
⋮
▼
⏎
Hex C2A0 is UTF8 for non-breaking space
ISO 8859-1 encodes what it refers to as “Latin alphabet no. 1”, consisting of 191 characters from the Latin script. Hence, also called Latin-1
gucharmap
With the gucharmap program a lot of characters can be shown with their UTF code
This program is not user friendly:
- It shows characters which are not on the system. There is no option to filter these out
- The list is of fonts is not sorted, nor classified
- There is no search function
Composekeys
Configuration
Add to /home/user/.xinitrc
setxkbmap -option compose:lwin
Add to /home/user/.xsession
source ~/.xinitrc & wmpid=$! xfce4-session wait $wmpid
wait might be replaced with waitpid for more readability (not tested)
Log out of Xwindows and log in again
Now the compose key is the left windows key
To use it, press it, release it and then enter the desired key sequence. For example
e =
which gives the € sign. Or do
compose key 1 2
for the ½ sign
More examples
e= → € l- → £ e" → ë "e → ë !e → ẹ ^o → ô /o → ø !? → ‽ ss → ß ^2 → ² ^3 → ³ oo → ° xx → × .. → … --- → — // → \ >> → » <- → ← -> → → =/ → ≠ |c → ¢ 12 → ½
Tool
symbol2unicode Generate unicode symbols from similar ascii character combinations
UTF-8
UTF-8 is backward compatible with ASCII. It uses a specific structure to differentiate between one-byte (ASCII) characters and multi-byte characters.
- Single-byte Characters (ASCII)
- In UTF-8, the first 128 characters (0 to 127) are identical to ASCII
- A single-byte UTF-8 character has its most significant bit (MSB) set to 0. Any byte with a value from '0x00' to '0x7F' represents an ASCII character
- Example: ASCII 'A' (decimal 65) is in UTF-8 '0x41' which is in decimal 65 and '01000001' in binary
- Multi-byte Characters
- UTF-8 uses a variable-length encoding scheme. Characters beyond the ASCII range are represented using two, three, or four bytes
- The first byte of a multi-byte character has its MSB set to 1
- The number of leading 1 bits in the first byte indicates how many bytes are used for that character
- Encoding Structure
- 1-byte (ASCII): '0xxxxxxx' (0-127)
- 2-byte: '110xxxxx 10xxxxxx'
- 3-byte: '1110xxxx 10xxxxxx 10xxxxxx'
- 4-byte: '11110xxx 10xxxxxx 10xxxxxx 10xxxxxx'
Enter Uni-code characters
Use CTRL-Shift-u Uni-code-character-hex-code
Use the uni-code character hex code
You might do CTRL-Shift-u and then release the keys an enter the code like 3bb or it might be needed to press and hold CTRL-Shift press 'u', enter the code like 3bb and then release the CTRL-Shift keys
Example:
CTRL-Shift-u 3bb -> λ
The leading zero's can be omitted
Do not use the SHIFT key
| Code | C | Name | Remark |
|---|---|---|---|
| 00b0 | ° | Degree | As in 25 °C |
| 1d49 | ᵉ | Superscript e | |
| 2082 | ₂ | Subscript 2 | As in H₂O |
| 2091 209B 1D631) | ₑₛᵣ | Subscript ₑₛᵣ. ESR as in Rₑₛᵣ. Unicode does not provide a dedicated subscript character for r. The Phonetic Modifier Letter Small r can be used |
UTF codes
| # | Character | Unicode name | Code point |
|---|---|---|---|
| 1 | © | COPYRIGHT SIGN | U+00A9 |
| 2 | ® | REGISTERED SIGN | U+00AE |
| 3 | ™ | TRADE MARK SIGN | U+2122 |
| 4 | § | SECTION SIGN | U+00A7 |
| 5 | ¶ | PILCROW SIGN (PARAGRAPH) | U+00B6 |
| 6 | † | DAGGER | U+2020 |
| 7 | ‡ | DOUBLE DAGGER | U+2021 |
| 8 | ‰ | PER MILLE SIGN | U+2030 |
| 9 | ‱ | PER TEN THOUSAND SIGN | U+2031 |
| 10 | ★ | BLACK STAR | U+2605 |
| 11 | ☆ | WHITE STAR | U+2606 |
| 12 | ☀︎ | BLACK SUN WITH RAYS | U+2600 |
| 13 | ☁︎ | CLOUD | U+2601 |
| 14 | ☂︎ | UMBRELLA | U+2602 |
| 15 | ☃︎ | SNOWMAN | U+2603 |
| 16 | ☎︎ | TELEPHONE | U+260E |
| 17 | ☏ | TELEPHONE LOCATION SIGN | U+260F |
| 18 | ☑︎ | BALLOT BOX WITH CHECK | U+2611 |
| 19 | ☒ | BALLOT BOX WITH X | U+2612 |
| 20 | ☕︎ | HOT BEVERAGE | U+2615 |
| 21 | ♠︎ | BLACK SPADE SUIT | U+2660 |
| 22 | ♣︎ | BLACK CLUB SUIT | U+2663 |
| 23 | ♥︎ | BLACK HEART SUIT | U+2665 |
| 24 | ♦︎ | BLACK DIAMOND SUIT | U+2666 |
| 25 | ♭ | MUSIC FLAT SIGN | U+266D |
| 26 | ♮ | MUSIC NATURAL SIGN | U+266E |
| 27 | ♯ | MUSIC SHARP SIGN | U+266F |
| 28 | ✓ | CHECK MARK | U+2713 |
| 29 | ✗ | BALLOT X | U+2717 |
| 30 | ✈︎ | AIRPLANE | U+2708 |
| 31 | ✉︎ | ENVELOPE | U+2709 |
| 32 | ✂︎ | SCISSORS | U+2702 |
| 33 | ✏︎ | PENCIL | U+270F |
| 34 | ✒︎ | BLACK NIB | U+2712 |
| 35 | ✨ | SPARKLES | U+2728 |
| 36 | ❄︎ | SNOWFLAKE | U+2744 |
| 37 | ❝ | HEAVY DOUBLE TURNED COMMA QUOTATION MARK ORNAMENT | U+275D |
| 38 | ❞ | HEAVY DOUBLE COMMA QUOTATION MARK ORNAMENT | U+275E |
| 39 | ❢ | HEAVY HEART EXCLAMATION MARK SYMBOL | U+2762 |
| 40 | ❣︎ | HEAVY HEART EXCLAMATION MARK SYMBOL | U+2763 |
| 41 | ➔ | RIGHTWARDS ARROW | U+2192 |
| 42 | ← | LEFTWARDS ARROW | U+2190 |
| 43 | ↑ | UPWARDS ARROW | U+2191 |
| 44 | ↓ | DOWNWARDS ARROW | U+2193 |
| 45 | ↔︎ | LEFT RIGHT ARROW | U+2194 |
| 46 | ↕︎ | UP DOWN ARROW | U+2195 |
| 47 | ⇧ | UPWARDS WHITE ARROW | U+21E7 |
| 48 | ⇩ | DOWNWARDS WHITE ARROW | U+21E9 |
| 49 | ⟶ | LONG RIGHTWARDS ARROW | U+27F6 |
| 50 | ⟵ | LONG LEFTWARDS ARROW | U+27F5 |
| 51 | ⟺ | LONG LEFT RIGHT ARROW | U+27FA |
| 52 | ∞ | INFINITY | U+221E |
| 53 | ≈ | ALMOST EQUAL TO | U+2248 |
| 54 | ≠ | NOT EQUAL TO | U+2260 |
| 55 | ≤ | LESS‑THAN OR EQUAL TO | U+2264 |
| 56 | ≥ | GREATER‑THAN OR EQUAL TO | U+2265 |
| 57 | ± | PLUS‑MINUS SIGN | U+00B1 |
| 58 | ÷ | DIVISION SIGN | U+00F7 |
| 59 | × | MULTIPLICATION SIGN | U+00D7 |
| 60 | ∂ | PARTIAL DIFFERENTIAL | U+2202 |
| 61 | ∑ | N-ARY SUMMATION | U+2211 |
| 62 | √ | SQUARE ROOT | U+221A |
| 63 | ∫ | INTEGRAL | U+222B |
| 64 | ∇ | NABLA (del) | U+2207 |
| 65 | ≈ | APPROXIMATELY EQUAL TO | U+2248 |
| 66 | Ω | OHM SIGN | U+2126 |
| 67 | µ | MICRO SIGN | U+00B5 |
| 68 | € | EURO SIGN | U+20AC |
| 69 | £ | POUND SIGN | U+00A3 |
| 70 | ¥ | YEN SIGN | U+00A5 |
| 71 | ¢ | CENT SIGN | U+00A2 |
| 72 | ‽ | INTERROBANG | U+203D |
| 73 | ⁂ | ASTERISM | U+2042 |
| 74 | ⁍ | HYPHEN BULLET | U+204D |
| 75 | ⁎ | LOW ASTERISK | U+204E |
| 76 | ⸮ | INTERROBANG WITH QUESTION MARK | U+2E2E |
| 77 | ⍰ | QUESTIONED SYMBOL | U+2370 |
| 78 | ⍟ | APL FUNCTIONAL SYMBOL CIRCLE STILE | U+235F |
| 79 | ⧉ | LIGHTNING BOLT | U+29C9 |
| 80 | ⧫ | BLACK DIAMOND MINUS WHITE X | U+29EB |
| 81 | ⧭ | LEFT RIGHT OPEN-HEADED ARROW | U+29ED |
| 82 | ⧖ | LEFTWARDS ARROW OVER RIGHTWARDS ARROW | U+2956 |
| 83 | ⧘ | LEFTWARDS ARROW THROUGH SMALL CIRCLE | U+2958 |
| 84 | ⧚ | LEFTWARDS ARROW OVER LEFTWARDS ARROW | U+295A |
| 85 | ⧜ | LEFTWARDS ARROW OVER RIGHTWARDS ARROW | U+295C |
| 86 | ⧞ | LEFTWARDS ARROW OVER LEFTWARDS ARROW | U+295E |
| 87 | ⧠ | WHITE SQUARE WITH VERTICAL BISECTING LINE | U+2960 |
| 88 | ⧢ | WHITE DIAGONAL CROSS | U+2962 |
| 89 | ⧤ | WHITE DIAGONAL CROSS FROM UPPER LEFT TO LOWER RIGHT | U+2964 |
| 90 | ⧦ | WHITE DIAGONAL CROSS FROM UPPER RIGHT TO LOWER LEFT | U+2966 |
| 91 | ⧨ | WHITE DIAGONAL CROSS FROM LOWER LEFT TO UPPER RIGHT | U+2968 |
| 92 | ⧪ | WHITE DIAGONAL CROSS FROM LOWER RIGHT TO UPPER LEFT | U+296A |
| 93 | ⧬ | WHITE DIAGONAL CROSS FROM CENTER TO EAST | U+296C |
| 94 | ⧮ | WHITE DIAGONAL CROSS FROM CENTER TO WEST | U+296E |
| 95 | ⧰ | WHITE DIAGONAL CROSS FROM CENTER TO NORTH | U+2970 |
| 96 | ⧲ | WHITE DIAGONAL CROSS FROM CENTER TO SOUTH | U+2972 |
| 97 | ⧴ | WHITE DIAGONAL CROSS FROM CENTER TO NORTHEAST | U+2974 |
| 98 | ⧶ | WHITE DIAGONAL CROSS FROM CENTER TO SOUTHEAST | U+2976 |
| 99 | ⧸ : | DIVISION SLASH | U+29F8 |
| 100 | ⧹ | REVERSE SOLIDUS | U+29F9 |
| 101 | ♥ | Heavy black heart | U+2764 |
| 102 | ★ | White star | U+2606 |
| 103 | ☯ | Yin yang | U+262F |
| 104 | ☮ | Peace symbol | U+262E |
| 105 | ⚡ | High voltage sign | U+26A1 |
| 106 | ✿ | Black florette | U+273F |
| 107 | ☘ | Shamrock | U+2618 |
| 108 | 💖 | Sparkling heart | U+1F496 |
| 109 | 🌟 | Glowing star | U+1F31F |
| 110 | 🌼 | Blossom | U+1F33C |
| 111 | 🌈 | Rainbow | U+1F308 |
| 112 | 🎉 | Party popper | U+1F389 |
| 113 | 🔥 | Fire | U+1F525 |
| 114 | 〰 | Wavy dash | U+3030 |
| 115 | 𝓦 | Script capital W | U+1D49C7 |
| 116 | 𝔚 | Fraktur capital W | U+1D51DB |
Work in progress
⃤ U20E4 ➊ U278A UE454 ⧗ U29D7 ⨂ U2A02 ⬛ U2B1B ⬤ U2B24 ⮚ U2B9A UE021 UE02A UE407 🕿 U1F57F 🖂 U1F582 🙼 U1F67C 🚘 U1F698 UE425 UE43F UE033 ▇ U2587 ⏰ U23F0 🁻 U1F07B 🂱 1F0B1
HTML
HTML characters can be used in documents via copy and paste. For example arrows. The method:
More characters
unicode.org
Alt-Codes, square signs and more
Compose key cheat sheet
How add more
Entering special characters in Vim
Symbol > unicode
Test for Unicode support in Web browsers (very much characters)
Linux keyboard text symbols: Compose key shortcuts
Compose key
ComposeKeys
XCOMM UTF-8 (Unicode) compose sequence
UTF-8 and Unicode FAQ for Unix/Linux
ComposeKey - Ubuntu documentation
More general information
Keyboard key bindings
/home/user/.Xmodmap
Example
keycode 26 = e E eacute Eacute EuroSign cent EuroSign
To get an idea how to change keybindings see: xmodmap nl
xbindkeys
The XMacro package contains two simple, C++ programs (xmacrorec and xmacroplay) for recording and replaying keyboard and mouse events on an X server
Keybindings for capslock
Add to /etc/default/keyboard (not tested) in stead of to .initrc / .xsessions
XKBOPTIONS="compose:lwin"
Using the caps key as compose key
setxkbmap -option compose:caps
Other information
keysym 0xffe9, Alt_L keysym 0xffea, Alt_R
Files
/home/user/.config/xfce4/xfconf/xfce-perchannel-xml/xfce4-keyboard-shortcuts.xml
Coding conversion
When one program needs an other coding for its input use something like
cat file.txt | iconv -c --from-code=ISO-8859-1 --to-code=UTF-8
-c Silently discard characters that cannot be converted instead of terminating when encountering such characters
UTF-8 not decoded
The xfce4-terminal 0.8.7.4 program is set to 'Default (UTF-8)' (see via Edit > Advanced tab > Encoding > Default character encoding)
When you get something like file�<80><99>s in a terminal try (rsync is an example):
- Run update-locale
- man rsycnc
(man rsync is an example)
If it does not resolve the issue try
LC_CTYPE=UTF-8; man rsync
Then we got this warning
bash: warning: setlocale: LC_CTYPE: cannot change locale (UTF-8): No such file or directory man: can't set the locale; make sure $LC_* and $LANG are correct
However in the output of man rsync all characters looked fine
If not, close the man page en run the man rsync again
If this does not make the characters readable try LC_CTYPE=C to make all characters readable. Example
LC_CTYPE=C; man rsync
Emoji's
G: Graphic emoji
T: Text emoji
| G | T | Explanation | Remark |
|---|---|---|---|
| 😐 | :-| | Neutral face | |
| Party time. An expression of happiness | |||
| Releived | |||
| You are welcome There is no specific emoji for this. A combination of 🙏 (folded hands) which is often used to express gratitude or a welcoming gesture, 😊 (smiling face with smiling eyes) which conveys friendliness and warmth and 🤗 (hugging face) which represents a warm welcome or embrace can be used | |||
| Zum kotsen. An expression of great dissatisfaction and anger2) |
Useful links
Main subjects on this wiki: Linux, Debian, HTML, Microcontrollers, Privacy
RSS
Disclaimer
Privacy statement
Bugs statement
Cookies
Copyright © : 2014 - 2026 Webevaluation.nl and the authors
Changes reserved.
