INDEX
Explanations
multilingual text or foreign words
New Auto-Interp
Negative Logits
uestions
-1.28
смарт
-1.27
метров
-1.27
ducts
-1.25
⿷
-1.22
TWENTY
-1.19
ッチン
-1.16
menak
-1.16
okolade
-1.14
hırka
-1.14
POSITIVE LOGITS
just
1.42
our
1.32
и
1.32
of
1.31
OGSÅ
1.29
:
1.21
και
1.19
说完
1.19
قدیمی
1.17
și
1.16
Activations Density 0.149%