INDEX
Explanations
words that include the syllable "ile."
New Auto-Interp
Negative Logits
re
-0.21
ingly
-0.20
reb
-0.19
ings
-0.18
res
-0.18
rie
-0.18
riel
-0.18
n
-0.18
roe
-0.17
die
-0.17
POSITIVE LOGITS
brities
0.26
urope
0.24
arning
0.21
psy
0.21
ighton
0.20
aders
0.20
phant
0.20
opard
0.20
ilgili
0.20
phants
0.19
Activations Density 0.093%