INDEX
Explanations
names of prominent political figures and their connections to governmental actions
mentions of people's names
New Auto-Interp
Negative Logits
eric
-0.81
nu
-0.67
Folk
-0.67
bler
-0.64
hend
-0.63
gery
-0.63
fields
-0.63
ISBN
-0.63
naires
-0.62
Conclusion
-0.62
POSITIVE LOGITS
CLOSE
0.71
ornings
0.69
asse
0.68
avanaugh
0.67
Rodham
0.66
reacts
0.66
vernight
0.63
halting
0.62
esson
0.61
ayson
0.61
Activations Density 0.067%