INDEX
Explanations
expressions related to emotional distress or conflict
preceding negative outcomes
kill, humiliation, insult, revenge
New Auto-Interp
Negative Logits
таратура
-0.60
enumii
-0.55
obod
-0.54
IsMutable
-0.52
brevemente
-0.51
ThroughAttribute
-0.51
quently
-0.50
Meanwhile
-0.50
"..\..\..\
-0.50
}{@-0.49
POSITIVE LOGITS
出版年
0.82
humiliation
0.77
blackmail
0.75
insult
0.71
تعدى
0.69
shameless
0.68
revenge
0.67
humiliating
0.66
insults
0.66
kill
0.65
Activations Density 0.052%