INDEX
Explanations
expressions of personal opinion and emphasis in conversations
New Auto-Interp
Negative Logits
-0.16
,
-0.16
gate
-0.16
.
-0.15
esh
-0.15
zin
-0.15
-
-0.15
Pro
-0.14
-
-0.14
Gall
-0.14
POSITIVE LOGITS
å·§
0.16
å¡ļ
0.16
endir
0.16
_trampoline
0.16
acker
0.16
RowAt
0.15
ught
0.15
(disposing
0.15
/epl
0.15
uede
0.15
Activations Density 0.139%