INDEX
Explanations
words related to political power
terms related to power dynamics and distribution
New Auto-Interp
Negative Logits
riad
-0.71
Taste
-0.71
Von
-0.70
verett
-0.69
romeda
-0.69
eret
-0.67
ead
-0.67
TAG
-0.67
ogene
-0.67
ALK
-0.66
POSITIVE LOGITS
levers
1.00
houses
0.98
vested
0.95
wielded
0.87
lessness
0.85
FUL
0.81
stroke
0.80
lifting
0.80
outage
0.79
vacuum
0.77
Activations Density 0.037%