INDEX
Explanations
terms associated with violence and aggression
New Auto-Interp
Negative Logits
NOPQRST
-0.95
Dek
-0.70
sembler
-0.69
مرئيه
-0.69
fermés
-0.68
méri
-0.67
fjspx
-0.67
profondeur
-0.66
følgelig
-0.66
луч
-0.66
POSITIVE LOGITS
violence
1.92
Violence
1.74
violent
1.69
violence
1.62
Violence
1.61
violen
1.58
Violent
1.58
Violent
1.58
violent
1.52
violento
1.32
Activations Density 0.088%