INDEX
Explanations
words related to political ideologies and affiliations
terms related to political affiliations and ideologies
New Auto-Interp
Negative Logits
uden
-0.70
ften
-0.69
smelled
-0.65
orah
-0.65
Carbuncle
-0.64
phe
-0.64
anwhile
-0.62
mma
-0.62
arij
-0.62
iggurat
-0.62
POSITIVE LOGITS
situations
0.90
backgrounds
0.85
positions
0.83
connections
0.82
pursuits
0.81
versions
0.80
scenarios
0.80
bodies
0.79
violations
0.79
associations
0.79
Activations Density 0.917%