INDEX
Explanations
negative phrases or expressions indicating doubt or lack of certainty
New Auto-Interp
Negative Logits
reen
-0.15
ìķĦëĭĪëĿ¼
-0.14
horribly
-0.14
å¹¶ä¸į
-0.14
avin
-0.14
olini
-0.13
ouv
-0.13
nonzero
-0.13
inia
-0.13
somehow
-0.13
POSITIVE LOGITS
any
0.25
anymore
0.24
spared
0.19
much
0.19
Any
0.18
Any
0.18
less
0.18
anyhow
0.18
any
0.18
anybody
0.17
Activations Density 0.221%