INDEX
Explanations
occurrences of the preposition "to."
New Auto-Interp
Negative Logits
izr
-0.16
immers
-0.15
xuyên
-0.15
otty
-0.14
öy
-0.14
icana
-0.14
/we
-0.14
íĢ
-0.14
-Sah
-0.14
_THREAD
-0.14
POSITIVE LOGITS
iker
0.16
alim
0.15
-the
0.15
intrinsic
0.15
ef
0.14
ycz
0.14
Anders
0.14
Percy
0.13
mage
0.13
isin
0.13
Activations Density 0.407%