INDEX
Explanations
mentions of the name "Diane Rodham" in the text
New Auto-Interp
Negative Logits
ters
-0.77
PDATE
-0.73
mare
-0.72
aic
-0.70
KT
-0.68
fect
-0.67
bidden
-0.67
tered
-0.67
eu
-0.64
ELL
-0.64
POSITIVE LOGITS
Rodham
1.16
Clinton
1.10
clinton
0.96
Clintons
0.89
Abedin
0.84
Clinton
0.81
INTON
0.79
umenthal
0.76
aide
0.75
velt
0.74
Activations Density 0.023%