INDEX
Explanations
mentions of political figures from different states and their actions or statements
abbreviations and designators related to political representation
New Auto-Interp
Negative Logits
compilation
-0.67
managers
-0.66
smugglers
-0.63
dehuman
-0.62
recomp
-0.62
Mandela
-0.61
traffickers
-0.60
referees
-0.60
policing
-0.59
Wenger
-0.58
POSITIVE LOGITS
)'
1.03
.),
0.90
veland
0.89
.,
0.89
inois
0.84
)
0.84
Republican
0.82
),
0.80
achusetts
0.80
appa
0.80
Activations Density 0.032%