INDEX
Explanations
references to libertarianism
references to libertarianism and related political concepts
New Auto-Interp
Negative Logits
ams
-0.74
ibo
-0.71
liam
-0.71
release
-0.70
older
-0.69
Lakes
-0.69
tch
-0.68
NEY
-0.67
WAY
-0.66
Drawn
-0.66
POSITIVE LOGITS
tarians
1.20
tarian
1.20
Cato
0.92
libertarian
0.92
anarchism
0.90
libertarians
0.88
anarchist
0.85
welf
0.81
republic
0.80
communism
0.80
Activations Density 0.010%