INDEX
Explanations
punctuation marks and sentence endings
New Auto-Interp
Negative Logits
ocab
-0.17
azzi
-0.16
iesz
-0.16
orthand
-0.16
VICE
-0.16
ÐĬ
-0.15
полез
-0.14
752
-0.14
ITU
-0.14
ogany
-0.14
POSITIVE LOGITS
axe
0.16
borders
0.15
kus
0.14
è͵
0.14
relative
0.14
amas
0.14
kaz
0.14
ans
0.14
=format
0.14
BST
0.13
Activations Density 0.000%