INDEX
Explanations
references to and mentions of "Biden."
mentions of Joe Biden
New Auto-Interp
Negative Logits
igators
-0.73
iances
-0.70
ELF
-0.70
IENT
-0.68
rical
-0.67
Anarchy
-0.67
istance
-0.67
eanor
-0.66
orically
-0.66
Democr
-0.66
POSITIVE LOGITS
Biden
1.03
ught
0.82
jug
0.82
zag
0.82
bent
0.81
hole
0.81
geon
0.73
Hath
0.73
ga
0.72
batch
0.70
Activations Density 0.032%