INDEX
Explanations
words related to political ideology
instances of the word "liberal" and related concepts
New Auto-Interp
Negative Logits
Downloadha
-0.81
BuyableInstoreAndOnline
-0.78
pty
-0.74
asso
-0.69
Blazing
-0.68
Clever
-0.68
yang
-0.67
Sharp
-0.66
staking
-0.66
AUT
-0.65
POSITIVE LOGITS
ization
1.04
arts
1.00
izing
0.98
ized
0.94
izers
0.93
enclave
0.89
izes
0.88
bloc
0.87
ize
0.86
izer
0.85
Activations Density 0.022%