INDEX
Explanations
references to historical figures related to war and political ideology, specifically focusing on Adolf Hitler
references to Adolf Hitler and related discussions
New Auto-Interp
Negative Logits
SHARE
-0.71
tis
-0.71
Self
-0.71
Dub
-0.70
Sleep
-0.69
Asia
-0.68
Sources
-0.68
Pac
-0.68
Medium
-0.68
ilver
-0.68
POSITIVE LOGITS
Hitler
1.07
enstein
0.90
dinand
0.90
olini
0.88
geist
0.85
salute
0.84
oleon
0.84
abad
0.82
umenthal
0.81
indoctr
0.79
Activations Density 0.015%