INDEX
Explanations
references to entities or events related to Nazi Germany
mentions of the Nazi Party and its associated historical references
New Auto-Interp
Negative Logits
Asia
-0.76
Dub
-0.74
20439
-0.71
tis
-0.69
ature
-0.68
TOR
-0.68
Interstitial
-0.66
AQ
-0.66
ately
-0.65
ables
-0.65
POSITIVE LOGITS
ocaust
1.14
chwitz
1.12
extermination
1.02
utsche
0.97
Holocaust
0.96
Germany
0.95
swast
0.95
Hitler
0.94
salute
0.92
Reich
0.91
Activations Density 0.079%