INDEX
Explanations
negative expressions of inability or prohibition
New Auto-Interp
Negative Logits
izon
-0.15
¼åIJĪ
-0.15
inding
-0.14
oque
-0.14
IMS
-0.14
gree
-0.13
δÏħ
-0.13
clearing
-0.13
sm
-0.13
Matches
-0.13
POSITIVE LOGITS
unm
0.16
freopen
0.15
distributed
0.15
orio
0.15
Cls
0.14
embroid
0.14
icha
0.14
ROTO
0.14
VERRIDE
0.14
yonel
0.14
Activations Density 0.036%