INDEX
Explanations
references to the historical figure Abraham Lincoln
mentions of the name Abraham, specifically in historical contexts
New Auto-Interp
Negative Logits
nces
-0.80
prus
-0.79
essee
-0.78
TOP
-0.69
agra
-0.69
ttes
-0.67
hift
-0.67
omething
-0.66
HMS
-0.65
lla
-0.64
POSITIVE LOGITS
raham
1.08
son
1.08
sen
1.04
Lincoln
0.99
sson
0.95
shire
0.88
elsen
0.82
sburg
0.80
antine
0.80
iak
0.78
Activations Density 0.039%