INDEX
Explanations
references to libertarianism and related political ideologies
New Auto-Interp
Negative Logits
aney
-0.16
stadt
-0.16
uran
-0.15
GNUC
-0.15
sdale
-0.14
ÇIJ
-0.14
agli
-0.14
baugh
-0.14
vrch
-0.14
burgh
-0.14
POSITIVE LOGITS
Ñģклад
0.16
isto
0.15
overl
0.14
arel
0.14
paralle
0.14
imals
0.14
-UA
0.14
alcon
0.13
(EC
0.13
spr
0.13
Activations Density 0.030%