INDEX
Explanations
mentions of political figures, particularly senators
references to senators
New Auto-Interp
Negative Logits
manual
-0.70
Siberian
-0.67
ponies
-0.66
oper
-0.63
footed
-0.63
unmarked
-0.62
managerial
-0.61
LTD
-0.61
hypers
-0.60
civilisation
-0.59
POSITIVE LOGITS
iors
1.35
eca
1.05
pai
1.04
escent
1.02
esse
0.98
egal
0.96
olt
0.93
ority
0.90
ileaks
0.90
uary
0.89
Activations Density 0.011%