INDEX
Explanations
names of people in a sports context
New Auto-Interp
Negative Logits
glers
-0.82
WAYS
-0.74
ties
-0.65
Indigo
-0.64
Activity
-0.63
lihood
-0.62
meal
-0.61
Pull
-0.60
Fahrenheit
-0.59
waters
-0.59
POSITIVE LOGITS
olition
1.32
eanor
1.27
agogue
1.26
ocrat
1.18
psey
1.09
ographics
1.09
agog
1.04
aterial
1.02
otion
1.01
otic
0.94
Activations Density 0.014%