INDEX
Explanations
references to various forms of violence or weaponry
New Auto-Interp
Negative Logits
فريبيس
-0.56
épaules
-0.53
lèvres
-0.52
uxxxx
-0.49
паспорт
-0.49
Skirt
-0.49
GEBURTSDATUM
-0.49
gown
-0.48
pacemaker
-0.48
sweatshirt
-0.48
POSITIVE LOGITS
umbrellas
0.81
Bibles
0.79
pens
0.75
knives
0.75
swords
0.74
backpacks
0.73
guitars
0.73
shovels
0.72
bottles
0.72
chairs
0.71
Activations Density 0.741%