INDEX
Explanations
descriptions or stories related to suffering or unfortunate events
instances of suffering or distress related to individuals
New Auto-Interp
Negative Logits
autions
-0.57
+.
-0.53
ometimes
-0.53
emonium
-0.51
accordingly
-0.50
.:
-0.49
nonetheless
-0.49
obin
-0.49
cellaneous
-0.48
etheless
-0.47
POSITIVE LOGITS
..."
0.80
)",
0.76
)</
0.70
)"
0.69
"))
0.67
©¶æ
0.65
})
0.65
?",
0.64
})
0.64
")
0.63
Activations Density 2.101%