INDEX
Explanations
expressions indicating action and interaction
New Auto-Interp
Negative Logits
ÑĩиÑģ
-0.15
etrofit
-0.15
ansom
-0.14
ktop
-0.14
toile
-0.14
alar
-0.13
Crest
-0.13
низ
-0.13
urent
-0.13
Ñİ
-0.13
POSITIVE LOGITS
ovich
0.15
anut
0.14
âĶ´
0.14
ubs
0.14
Ars
0.14
aptors
0.13
alist
0.13
ì¡°ìĤ¬
0.13
\Table
0.13
ela
0.13
Activations Density 2.606%