INDEX
Explanations
proper nouns related to politics and government actions
New Auto-Interp
Negative Logits
yrinth
-0.92
cffffcc
-0.82
Downloadha
-0.80
ITAL
-0.77
interstitial
-0.76
ategory
-0.75
itual
-0.75
REM
-0.74
awaru
-0.72
ENA
-0.71
POSITIVE LOGITS
supremacist
1.30
supremacists
1.21
house
1.08
supremacy
1.04
beard
1.02
bread
1.00
caps
0.97
hall
0.95
Sox
0.94
nationalist
0.94
Activations Density 0.316%