INDEX
Explanations
mentions of political labels like 'liberals' and 'conservatives'
references to liberals and their associated concepts or values
New Auto-Interp
Negative Logits
Delivery
-0.70
inventoryQuantity
-0.67
submission
-0.64
increments
-0.61
circumstances
-0.59
wise
-0.58
Owner
-0.58
Account
-0.58
domain
-0.58
Territories
-0.57
POSITIVE LOGITS
ervatives
1.36
aurus
1.32
paces
1.32
ervative
1.30
hip
1.06
chool
1.02
mith
1.02
hips
1.00
ongs
0.92
heet
0.92
Activations Density 0.082%