INDEX
Explanations
monetary values or currency symbols
New Auto-Interp
Negative Logits
s
-0.33
$$$$
-0.32
h
-0.28
i
-0.26
m
-0.25
c
-0.24
r
-0.24
¨
-0.24
$$$
-0.23
$↵
-0.23
POSITIVE LOGITS
æĬ
0.14
w
0.13
Duy
0.13
çī
0.13
tol
0.13
olest
0.13
Kling
0.13
enz
0.13
mlin
0.13
Midi
0.13
Activations Density 0.057%