INDEX
Explanations
names of famous individuals
names of notable individuals and references to performance or reputation
New Auto-Interp
Negative Logits
obo
-0.71
istically
-0.70
ño
-0.66
iru
-0.66
ASH
-0.65
iso
-0.64
gdala
-0.62
Seg
-0.60
emia
-0.60
llah
-0.60
POSITIVE LOGITS
Phelps
0.95
Manson
0.89
icka
0.87
mans
0.82
achusetts
0.79
mann
0.79
liga
0.78
ter
0.77
itudes
0.77
enium
0.76
Activations Density 0.024%