INDEX
Explanations
occurrences of the word "it" and variations of the verb "to be."
New Auto-Interp
Negative Logits
's
-0.18
ิà¸į
-0.16
´s
-0.15
`s
-0.15
endez
-0.15
enco
-0.14
'aut
-0.14
psc
-0.14
jadx
-0.14
’s
-0.14
POSITIVE LOGITS
trans
0.29
suff
0.21
seems
0.19
follows
0.19
seem
0.19
is
0.18
iner
0.18
should
0.18
appears
0.18
appe
0.18
Activations Density 0.110%