INDEX
Explanations
references to Nazi-related terms and concepts
references to the Nazi regime and its associated historical context
New Auto-Interp
Negative Logits
pole
-0.78
tis
-0.74
Interstitial
-0.74
pring
-0.73
20439
-0.72
Dub
-0.72
notes
-0.71
player
-0.68
area
-0.67
forward
-0.67
POSITIVE LOGITS
Hitler
0.98
ocaust
0.88
Germany
0.87
chwitz
0.86
Youth
0.85
salute
0.84
Holocaust
0.84
wald
0.80
swast
0.79
Nazi
0.79
Activations Density 0.058%