INDEX
Explanations
symbols or special characters that indicate emphasis
special characters or symbols in the text
New Auto-Interp
Negative Logits
tein
-0.66
pton
-0.65
Spit
-0.64
Spice
-0.63
idine
-0.62
Ultr
-0.61
pher
-0.60
plex
-0.60
iland
-0.59
warts
-0.58
POSITIVE LOGITS
ŀ
1.34
Ĭ
1.15
Ĺ
1.15
ļ
1.05
ĵ
1.01
¿
1.00
Ĩ
0.99
µ
0.99
ł
0.98
³
0.97
Activations Density 0.003%