INDEX
Explanations
words related to political ideologies, particularly the political right
references to the political right
New Auto-Interp
Negative Logits
ĸļ
-0.80
ADRA
-0.71
ulative
-0.71
Continuous
-0.70
cit
-0.68
Zot
-0.66
Specifications
-0.66
Remastered
-0.66
Seasons
-0.65
DAQ
-0.62
POSITIVE LOGITS
wing
1.20
eous
1.07
wing
1.07
ward
1.04
move
0.88
winger
0.86
flank
0.84
lander
0.83
shore
0.72
Wing
0.70
Activations Density 0.040%