INDEX
Explanations
the name "Griffords"
the presence of the name "Giffords."
New Auto-Interp
Negative Logits
ctica
-0.66
igator
-0.65
rily
-0.64
Journalism
-0.64
thia
-0.64
qqa
-0.64
zech
-0.63
GREEN
-0.61
atform
-0.61
ACTED
-0.61
POSITIVE LOGITS
iculty
1.13
erence
0.91
erences
0.87
ield
0.86
erey
0.86
doms
0.85
ness
0.83
rence
0.82
icult
0.81
enegger
0.80
Activations Density 0.025%