INDEX
Explanations
names related to historical figures, particularly focusing on Abraham Lincoln
mentions of the name "Abraham."
New Auto-Interp
Negative Logits
nces
-0.85
prus
-0.79
Flavoring
-0.75
ttes
-0.72
essee
-0.72
HMS
-0.71
lla
-0.69
hift
-0.68
eq
-0.67
TOP
-0.67
POSITIVE LOGITS
Lincoln
1.01
shire
0.98
raham
0.97
sson
0.96
son
0.93
Abraham
0.88
sen
0.85
sburg
0.84
lisher
0.78
ode
0.77
Activations Density 0.010%