INDEX
Explanations
words related to acts of violence or killing
occurrences of the word "slaughter."
New Auto-Interp
Negative Logits
Ronaldo
-0.64
Ric
-0.62
Richards
-0.61
Crom
-0.61
Princ
-0.60
Sovereign
-0.60
Scot
-0.60
âĵĺ
-0.59
retention
-0.59
aucas
-0.59
POSITIVE LOGITS
Slaughter
1.17
slaughter
1.15
houses
1.08
\\\\\\\\
1.01
edIn
0.93
ãĥ¼ãĤ¯
0.92
house
0.91
quished
0.90
fest
0.88
slaughtered
0.84
Activations Density 0.012%