INDEX
Explanations
negative evaluations of actions and situations
New Auto-Interp
Negative Logits
ActionBar
-0.16
IMITIVE
-0.15
rine
-0.15
ký
-0.15
PRETTY
-0.15
azo
-0.14
Rope
-0.14
ihan
-0.14
ipse
-0.14
PILE
-0.14
POSITIVE LOGITS
nor
0.20
anymore
0.17
nor
0.17
asco
0.15
ono
0.15
neither
0.15
Nor
0.14
etchup
0.14
Nor
0.14
جات
0.14
Activations Density 0.407%