INDEX
Explanations
proper nouns, specifically names of places or people
references to specific individuals or notable figures
New Auto-Interp
Negative Logits
aciously
-0.86
bsp
-0.76
Gutenberg
-0.75
uous
-0.71
EMP
-0.69
information
-0.68
debtor
-0.68
doct
-0.67
unction
-0.66
ifier
-0.65
POSITIVE LOGITS
Carroll
1.09
sburg
0.90
mont
0.88
ton
0.86
Gardens
0.84
lean
0.79
Shelby
0.79
oqu
0.78
dale
0.75
anut
0.75
Activations Density 0.010%