INDEX
Explanations
instances of the word "to" and its various forms in the context of potential actions or states
New Auto-Interp
Negative Logits
Beam
-0.19
Beat
-0.16
Beam
-0.16
Beat
-0.15
ibu
-0.14
nám
-0.13
kaar
-0.13
olumn
-0.13
.scalablytyped
-0.13
ÑıÑģÑĮ
-0.13
POSITIVE LOGITS
be
1.18
be
0.63
Be
0.56
باشد
0.51
be
0.48
Be
0.47
_be
0.45
.be
0.45
seja
0.43
бÑĭÑĤÑĮ
0.41
Activations Density 0.396%