INDEX
Explanations
mentions of the word "ay" and its variations
New Auto-Interp
Negative Logits
sy
-0.21
د
-0.20
su
-0.19
s
-0.19
erer
-0.19
erator
-0.18
sell
-0.17
sch
-0.17
er
-0.16
sen
-0.16
POSITIVE LOGITS
urved
0.23
enne
0.23
yyyy
0.23
yyy
0.21
eur
0.20
den
0.20
YYYY
0.20
ton
0.20
eb
0.20
oncé
0.20
Activations Density 0.098%