INDEX
Explanations
names of individuals, especially related to sports and entertainment
names of individuals in the context of entertainment and sports
New Auto-Interp
Negative Logits
onyms
-0.83
lihood
-0.78
abases
-0.72
yre
-0.69
elist
-0.69
ukemia
-0.69
gered
-0.68
ERAL
-0.68
liest
-0.67
roleum
-0.67
POSITIVE LOGITS
Abrams
0.85
feld
0.78
ynski
0.75
hower
0.73
factor
0.67
="#
0.67
Gunn
0.66
inoc
0.66
icum
0.65
mount
0.64
Activations Density 0.012%