INDEX
Explanations
names that end with "son"
occurrences of the word "son."
New Auto-Interp
Negative Logits
heels
-0.71
odies
-0.69
deficits
-0.66
spread
-0.66
acebook
-0.62
ANN
-0.61
ASED
-0.61
contagious
-0.60
chronically
-0.60
arded
-0.60
POSITIVE LOGITS
nen
0.92
strom
0.89
son
0.88
hood
0.87
ville
0.84
nets
0.84
ning
0.83
ja
0.81
stown
0.79
ryu
0.79
Activations Density 0.017%