User Tools

Site Tools


characters

If you want to send us your comments, please do so. Thanks
More on comments


Special characters




Hex C2A0 is UTF8 for non-breaking space

ISO 8859-1 encodes what it refers to as “Latin alphabet no. 1”, consisting of 191 characters from the Latin script. Hence, also called Latin-1

gucharmap

With the gucharmap program a lot of characters can be shown with their UTF code
This program is not user friendly:

  • It shows characters which are not on the system. There is no option to filter these out
  • The list is of fonts is not sorted, nor classified
  • There is no search function

Composekeys

Configuration

Add to /home/user/.xinitrc

setxkbmap -option compose:lwin

Add to /home/user/.xsession

source ~/.xinitrc &
wmpid=$!
xfce4-session
wait $wmpid

wait might be replaced with waitpid for more readability (not tested)
Log out of Xwindows and log in again
Now the compose key is the left windows key
To use it, press it, release it and then enter the desired key sequence. For example

e
=

which gives the € sign. Or do

compose key 
1
2

for the ½ sign

More examples

e= → €
l- → £
e" → ë "e → ë
!e → ẹ
^o → ô
/o → ø
!? → ‽
ss → ß
^2 → ²
^3 → ³
oo → °
xx → ×
.. → …
--- → —
// → \
>> → »
<- → ← 
-> → →
=/ → ≠
|c → ¢
12 → ½

Tool

symbol2unicode Generate unicode symbols from similar ascii character combinations

UTF-8

UTF-8 is backward compatible with ASCII. It uses a specific structure to differentiate between one-byte (ASCII) characters and multi-byte characters.

  • Single-byte Characters (ASCII)
    • In UTF-8, the first 128 characters (0 to 127) are identical to ASCII
    • A single-byte UTF-8 character has its most significant bit (MSB) set to 0. Any byte with a value from '0x00' to '0x7F' represents an ASCII character
    • Example: ASCII 'A' (decimal 65) is in UTF-8 '0x41' which is in decimal 65 and '01000001' in binary
  • Multi-byte Characters
    • UTF-8 uses a variable-length encoding scheme. Characters beyond the ASCII range are represented using two, three, or four bytes
    • The first byte of a multi-byte character has its MSB set to 1
    • The number of leading 1 bits in the first byte indicates how many bytes are used for that character
    • Encoding Structure
      • 1-byte (ASCII): '0xxxxxxx' (0-127)
      • 2-byte: '110xxxxx 10xxxxxx'
      • 3-byte: '1110xxxx 10xxxxxx 10xxxxxx'
      • 4-byte: '11110xxx 10xxxxxx 10xxxxxx 10xxxxxx'

Enter Uni-code characters

Use CTRL-Shift-u Uni-code-character-hex-code

Use the uni-code character hex code
You might do CTRL-Shift-u and then release the keys an enter the code like 3bb or it might be needed to press and hold CTRL-Shift press 'u', enter the code like 3bb and then release the CTRL-Shift keys

Example:

CTRL-Shift-u 3bb -> λ

The leading zero's can be omitted
Do not use the SHIFT key

CodeCNameRemark
00b0°DegreeAs in 25 °C
1d49Superscript e
2082Subscript 2As in H₂O
2091
209B
1D631)
ₑₛᵣSubscript ₑₛᵣ. ESR as in Rₑₛᵣ. Unicode does not provide a dedicated subscript character for r. The Phonetic Modifier Letter Small r can be used

UTF codes

#CharacterUnicode nameCode point
1©COPYRIGHT SIGNU+00A9
2®REGISTERED SIGNU+00AE
3TRADE MARK SIGNU+2122
4§SECTION SIGNU+00A7
5PILCROW SIGN (PARAGRAPH)U+00B6
6DAGGERU+2020
7DOUBLE DAGGERU+2021
8PER MILLE SIGNU+2030
9PER TEN THOUSAND SIGNU+2031
10BLACK STARU+2605
11WHITE STARU+2606
12☀︎BLACK SUN WITH RAYSU+2600
13☁︎CLOUDU+2601
14☂︎UMBRELLAU+2602
15☃︎SNOWMANU+2603
16☎︎TELEPHONEU+260E
17TELEPHONE LOCATION SIGNU+260F
18☑︎BALLOT BOX WITH CHECKU+2611
19BALLOT BOX WITH XU+2612
20☕︎HOT BEVERAGEU+2615
21♠︎BLACK SPADE SUITU+2660
22♣︎BLACK CLUB SUITU+2663
23♥︎BLACK HEART SUITU+2665
24♦︎BLACK DIAMOND SUITU+2666
25MUSIC FLAT SIGNU+266D
26MUSIC NATURAL SIGNU+266E
27MUSIC SHARP SIGNU+266F
28CHECK MARKU+2713
29BALLOT XU+2717
30✈︎AIRPLANEU+2708
31✉︎ENVELOPEU+2709
32✂︎SCISSORSU+2702
33✏︎PENCILU+270F
34✒︎BLACK NIBU+2712
35SPARKLESU+2728
36❄︎SNOWFLAKEU+2744
37HEAVY DOUBLE TURNED COMMA QUOTATION MARK ORNAMENTU+275D
38HEAVY DOUBLE COMMA QUOTATION MARK ORNAMENTU+275E
39HEAVY HEART EXCLAMATION MARK SYMBOLU+2762
40❣︎HEAVY HEART EXCLAMATION MARK SYMBOLU+2763
41RIGHTWARDS ARROWU+2192
42LEFTWARDS ARROWU+2190
43UPWARDS ARROWU+2191
44DOWNWARDS ARROWU+2193
45↔︎LEFT RIGHT ARROWU+2194
46↕︎UP DOWN ARROWU+2195
47UPWARDS WHITE ARROWU+21E7
48DOWNWARDS WHITE ARROWU+21E9
49LONG RIGHTWARDS ARROWU+27F6
50LONG LEFTWARDS ARROWU+27F5
51LONG LEFT RIGHT ARROWU+27FA
52INFINITYU+221E
53ALMOST EQUAL TOU+2248
54NOT EQUAL TOU+2260
55LESS‑THAN OR EQUAL TOU+2264
56GREATER‑THAN OR EQUAL TOU+2265
57±PLUS‑MINUS SIGNU+00B1
58÷DIVISION SIGNU+00F7
59×MULTIPLICATION SIGNU+00D7
60PARTIAL DIFFERENTIALU+2202
61N-ARY SUMMATIONU+2211
62SQUARE ROOTU+221A
63INTEGRALU+222B
64NABLA (del)U+2207
65APPROXIMATELY EQUAL TOU+2248
66OHM SIGNU+2126
67µMICRO SIGNU+00B5
68EURO SIGNU+20AC
69£POUND SIGNU+00A3
70¥YEN SIGNU+00A5
71¢CENT SIGNU+00A2
72INTERROBANGU+203D
73ASTERISMU+2042
74HYPHEN BULLETU+204D
75LOW ASTERISKU+204E
76INTERROBANG WITH QUESTION MARKU+2E2E
77QUESTIONED SYMBOLU+2370
78APL FUNCTIONAL SYMBOL CIRCLE STILEU+235F
79LIGHTNING BOLTU+29C9
80BLACK DIAMOND MINUS WHITE XU+29EB
81LEFT RIGHT OPEN-HEADED ARROWU+29ED
82LEFTWARDS ARROW OVER RIGHTWARDS ARROWU+2956
83LEFTWARDS ARROW THROUGH SMALL CIRCLEU+2958
84LEFTWARDS ARROW OVER LEFTWARDS ARROWU+295A
85LEFTWARDS ARROW OVER RIGHTWARDS ARROWU+295C
86LEFTWARDS ARROW OVER LEFTWARDS ARROWU+295E
87WHITE SQUARE WITH VERTICAL BISECTING LINEU+2960
88WHITE DIAGONAL CROSSU+2962
89WHITE DIAGONAL CROSS FROM UPPER LEFT TO LOWER RIGHTU+2964
90WHITE DIAGONAL CROSS FROM UPPER RIGHT TO LOWER LEFTU+2966
91WHITE DIAGONAL CROSS FROM LOWER LEFT TO UPPER RIGHTU+2968
92WHITE DIAGONAL CROSS FROM LOWER RIGHT TO UPPER LEFTU+296A
93WHITE DIAGONAL CROSS FROM CENTER TO EASTU+296C
94WHITE DIAGONAL CROSS FROM CENTER TO WESTU+296E
95WHITE DIAGONAL CROSS FROM CENTER TO NORTHU+2970
96WHITE DIAGONAL CROSS FROM CENTER TO SOUTHU+2972
97WHITE DIAGONAL CROSS FROM CENTER TO NORTHEASTU+2974
98WHITE DIAGONAL CROSS FROM CENTER TO SOUTHEASTU+2976
99⧸ :DIVISION SLASHU+29F8
100REVERSE SOLIDUSU+29F9
101Heavy black heartU+2764
102White starU+2606
103Yin yangU+262F
104Peace symbolU+262E
105High voltage signU+26A1
106Black floretteU+273F
107ShamrockU+2618
108💖Sparkling heartU+1F496
109🌟Glowing starU+1F31F
110🌼BlossomU+1F33C
111🌈RainbowU+1F308
112🎉Party popperU+1F389
113🔥FireU+1F525
114Wavy dashU+3030
115𝓦Script capital WU+1D49C7
116𝔚Fraktur capital WU+1D51DB

Work in progress

⃤ U20E4 ➊ U278A  UE454 ⧗ U29D7 ⨂ U2A02 ⬛ U2B1B ⬤ U2B24 ⮚ U2B9A  UE021  UE02A  UE407 🕿 U1F57F 🖂 U1F582 🙼 U1F67C 🚘 U1F698  UE425  UE43F  UE033 ▇ U2587 ⏰ U23F0 🁻 U1F07B 🂱 1F0B1

HTML

HTML characters can be used in documents via copy and paste. For example arrows. The method:

  • Select a character on the arrows page
  • Enter the code in W3schools try it page (replace the Euro &#x20AC; code)
  • Click on the 'Run' button
  • Copy the character on the right side
  • Past the character in the document

More characters

More general information

Keyboard key bindings

/home/user/.Xmodmap

Example

keycode  26 = e E eacute Eacute EuroSign cent EuroSign

To get an idea how to change keybindings see: xmodmap nl
xbindkeys
The XMacro package contains two simple, C++ programs (xmacrorec and xmacroplay) for recording and replaying keyboard and mouse events on an X server

Keybindings for capslock

Add to /etc/default/keyboard (not tested) in stead of to .initrc / .xsessions

XKBOPTIONS="compose:lwin"

Using the caps key as compose key

setxkbmap -option compose:caps

Other information

keysym 0xffe9, Alt_L
keysym 0xffea, Alt_R

Files

/home/user/.config/xfce4/xfconf/xfce-perchannel-xml/xfce4-keyboard-shortcuts.xml

Coding conversion

When one program needs an other coding for its input use something like

cat file.txt | iconv -c --from-code=ISO-8859-1 --to-code=UTF-8

-c Silently discard characters that cannot be converted instead of terminating when encountering such characters

UTF-8 not decoded

The xfce4-terminal 0.8.7.4 program is set to 'Default (UTF-8)' (see via Edit > Advanced tab > Encoding > Default character encoding)
When you get something like file�<80><99>s in a terminal try (rsync is an example):

  • Set the locale as in this example of a locale file (change what is needed for your local needs)
  • Run update-locale
  • man rsycnc

(man rsync is an example)
If it does not resolve the issue try

LC_CTYPE=UTF-8; man rsync

Then we got this warning

bash: warning: setlocale: LC_CTYPE: cannot change locale (UTF-8): No such file or directory
man: can't set the locale; make sure $LC_* and $LANG are correct

However in the output of man rsync all characters looked fine
If not, close the man page en run the man rsync again

If this does not make the characters readable try LC_CTYPE=C to make all characters readable. Example

LC_CTYPE=C; man rsync

Emoji's

G: Graphic emoji
T: Text emoji

GTExplanationRemark
😐:-|Neutral face
Party time. An expression of happiness
Releived
You are welcome
There is no specific emoji for this. A combination of 🙏 (folded hands) which is often used to express gratitude or a welcoming gesture, 😊 (smiling face with smiling eyes) which conveys friendliness and warmth and 🤗 (hugging face) which represents a warm welcome or embrace can be used
Zum kotsen. An expression of great dissatisfaction and anger2)

unicode-6.0
List_of_emoticons

Hex editing
Archwiki xmodmap


Main subjects on this wiki: Linux, Debian, HTML, Microcontrollers, Privacy

RSS
Disclaimer
Privacy statement
Bugs statement
Cookies
Copyright © : 2014 - 2026 Webevaluation.nl and the authors
Changes reserved.

1)
Phonetic Modifier Letter Small r (ᵣ)
2)
Ein Ausdruck großer Unzufriedenheit und Wut
characters.txt · Last modified: by wim