INDEX
Explanations
references to political affiliations, particularly "R" for Republican and "D" for Democrat
New Auto-Interp
Negative Logits
supplies
-0.67
disclaim
-0.65
compilation
-0.64
gratification
-0.63
Mandela
-0.63
Guinness
-0.63
transports
-0.62
variables
-0.62
exploits
-0.62
tutorials
-0.61
POSITIVE LOGITS
ICH
0.97
appa
0.93
ollo
0.93
aska
0.92
)'
0.91
iffe
0.90
NJ
0.90
UST
0.86
ump
0.85
achus
0.85
Activations Density 0.018%