INDEX
Explanations
the frequent use of the word "just."
New Auto-Interp
Negative Logits
exactly
-0.18
only
-0.16
idget
-0.16
právÄĽ
-0.15
orsi
-0.15
именно
-0.14
such
-0.14
ONLY
-0.14
no
-0.14
rete
-0.14
POSITIVE LOGITS
ifiable
0.18
ifying
0.18
uxtap
0.17
iban
0.15
ä¿Ĥ
0.15
born
0.15
_called
0.15
ifications
0.15
æģĴ
0.14
ified
0.14
Activations Density 0.091%