INDEX
Explanations
references to human anatomy, specifically arms
references to weapons or the arms industry
New Auto-Interp
Negative Logits
Woodward
-0.84
Dough
-0.77
ãĤ©
-0.72
TION
-0.70
ded
-0.70
advertisement
-0.68
IENCE
-0.66
Sound
-0.65
Vide
-0.65
gres
-0.64
POSITIVE LOGITS
ament
1.14
aments
0.94
arms
0.90
chair
0.89
ageddon
0.86
arm
0.81
illary
0.80
guards
0.79
bands
0.77
phabet
0.76
Activations Density 0.009%