INDEX
Explanations
the demonstrative word "this" in various contexts
New Auto-Interp
Negative Logits
zent
-0.20
ollo
-0.19
iesel
-0.17
-fw
-0.16
aina
-0.16
Cent
-0.15
-:-
-0.15
enie
-0.15
èĭ¹æŀľ
-0.15
iona
-0.15
POSITIVE LOGITS
ordinate
0.16
/if
0.15
ESL
0.14
Tat
0.14
mes
0.14
options
0.14
ieder
0.13
.BLL
0.13
ACL
0.13
šti
0.13
Activations Density 0.135%