INDEX
Explanations
the word "inn" with different levels of emphasis
occurrences of the substring "inn" within words
New Auto-Interp
Negative Logits
ngth
-0.75
gencies
-0.70
xual
-0.68
ffen
-0.65
¶æ
-0.64
lda
-0.63
ques
-0.63
©¶æ
-0.62
litter
-0.62
reon
-0.61
POSITIVE LOGITS
ipeg
1.51
ovation
1.20
spir
1.12
ikuman
1.12
osuke
1.01
igan
0.96
ocent
0.94
iband
0.93
ings
0.91
umerable
0.90
Activations Density 0.017%