INDEX
Explanations
names, likely focusing on first names
names of people, particularly those involved in notable events or contexts
New Auto-Interp
Negative Logits
iance
-0.89
itect
-0.88
aic
-0.87
ibel
-0.77
iers
-0.77
inar
-0.74
yrinth
-0.74
oÄŁ
-0.74
efficient
-0.73
ectar
-0.72
POSITIVE LOGITS
Rogers
1.02
Bobby
0.99
Doodle
0.89
Willie
0.84
Benny
0.81
Burns
0.79
Pett
0.79
Tyson
0.79
Horton
0.79
Hank
0.78
Activations Density 0.047%