INDEX
Explanations
themes related to punishment and revenge
New Auto-Interp
Negative Logits
ÏĢλα
-0.14
iscard
-0.14
.opend
-0.13
æĥij
-0.13
Alarm
-0.13
íĹĮ
-0.13
Alarm
-0.13
epam
-0.13
ayd
-0.12
coma
-0.12
POSITIVE LOGITS
revenge
0.63
vengeance
0.61
Revenge
0.49
venge
0.48
retaliation
0.47
retali
0.41
retal
0.41
vend
0.40
pay
0.37
justice
0.36
Activations Density 0.136%