INDEX
Explanations
last names ending in "ent" or "ent" itself
New Auto-Interp
Negative Logits
BILITIES
-0.78
STON
-0.75
issance
-0.68
STER
-0.65
iets
-0.64
BILITY
-0.63
side
-0.63
Heist
-0.63
Scythe
-0.62
CHR
-0.61
POSITIVE LOGITS
ucky
1.13
ral
1.12
ropy
1.12
inel
1.00
rification
0.96
ertain
0.95
acles
0.95
acion
0.94
reprene
0.94
imental
0.93
Activations Density 0.027%