INDEX
Explanations
names, likely of people or places
occurrences of the substring "lo" in words
New Auto-Interp
Negative Logits
tremend
-0.74
krit
-0.69
indebted
-0.63
glim
-0.61
Bethesda
-0.60
principals
-0.60
agric
-0.59
wee
-0.56
transient
-0.56
curfew
-0.56
POSITIVE LOGITS
onne
0.92
odor
0.86
ohyd
0.85
ãĤ¼ãĤ¦ãĤ¹
0.84
ades
0.79
reau
0.77
anwhile
0.77
anne
0.74
ucci
0.73
ello
0.73
Activations Density 0.081%