INDEX
Explanations
names or identifiers of individuals or entities
occurrences of the word "names."
New Auto-Interp
Negative Logits
OPLE
-0.75
Bed
-0.67
UTERS
-0.66
yrinth
-0.65
Bulletin
-0.65
irth
-0.64
Springs
-0.63
UGE
-0.62
IENT
-0.61
Returns
-0.60
POSITIVE LOGITS
paces
1.66
pace
1.20
paced
1.09
plates
1.06
hips
1.03
mith
0.97
peed
0.96
erver
0.94
ames
0.94
aliases
0.89
Activations Density 0.027%