INDEX
Explanations
words related to punishment and physical discomfort
references to conflict and contentious themes, particularly involving the concept of "Jihad" and issues related to control and compliance
New Auto-Interp
Negative Logits
ocating
-0.69
atar
-0.65
eyed
-0.64
lag
-0.64
anca
-0.63
ischer
-0.62
dead
-0.61
orman
-0.61
vm
-0.60
cffffcc
-0.60
POSITIVE LOGITS
PRESS
2.74
Jihad
1.69
LET
1.57
pun
1.26
rugged
1.25
ersion
1.15
conform
1.13
Spread
1.10
Style
1.04
plet
0.97
Activations Density 0.051%