INDEX
Explanations
occurrences of the word "speaks" or its variations
New Auto-Interp
Negative Logits
baugh
-0.16
ään
-0.15
ourn
-0.15
ouz
-0.14
ogui
-0.14
ÃŃna
-0.14
ouri
-0.14
rosso
-0.14
Everyday
-0.14
unan
-0.14
POSITIVE LOGITS
zial
0.27
ake
0.27
aks
0.26
akers
0.25
acial
0.25
arm
0.24
ical
0.24
icher
0.24
ck
0.23
aking
0.23
Activations Density 0.007%