INDEX
Explanations
references to the name "Nancy."
New Auto-Interp
Negative Logits
points
-0.78
=-=-=-=-
-0.73
=-=-
-0.73
upon
-0.73
orically
-0.72
tered
-0.72
angers
-0.71
Redditor
-0.70
ickr
-0.70
guiActiveUnfocused
-0.70
POSITIVE LOGITS
Pelosi
1.38
Reagan
0.92
Drew
0.92
Grace
0.89
Lan
0.83
Ker
0.79
Lou
0.79
vier
0.77
Mae
0.77
Child
0.76
Activations Density 0.004%