INDEX
Explanations
names of individuals or fictional characters
words related to scientific or biological classifications, particularly names of species or genera
New Auto-Interp
Negative Logits
ishers
-0.82
sheet
-0.82
ry
-0.75
ered
-0.75
ried
-0.75
istically
-0.74
ership
-0.73
sie
-0.72
erick
-0.71
ishly
-0.71
POSITIVE LOGITS
aurus
1.21
ocial
1.01
hift
0.97
CRIP
0.96
ource
0.94
peed
0.93
ilon
0.88
henko
0.87
ktop
0.87
cale
0.85
Activations Density 0.107%