INDEX
Explanations
occurrences of the word "name" in various forms
New Auto-Interp
Negative Logits
yrinth
-0.89
psey
-0.75
EMS
-0.73
elaide
-0.73
irth
-0.71
isoft
-0.68
asio
-0.68
istar
-0.67
icult
-0.66
ĸļ
-0.66
POSITIVE LOGITS
plates
1.39
plate
1.22
paces
1.21
names
1.00
paced
0.97
brand
0.91
names
0.84
ames
0.83
aliases
0.83
calling
0.81
Activations Density 0.043%