INDEX
Explanations
terms related to liberal ideologies
references to liberal ideology and associated concepts
New Auto-Interp
Negative Logits
iru
-0.76
pty
-0.68
atoon
-0.67
BLE
-0.66
breaker
-0.66
angan
-0.66
Danger
-0.65
Blazing
-0.65
atel
-0.64
abba
-0.63
POSITIVE LOGITS
ization
1.27
izing
1.17
izers
1.13
arts
1.08
ized
1.07
izations
1.04
izes
1.01
isation
1.00
izer
0.99
ize
0.93
Activations Density 0.024%