INDEX
Explanations
instances of emphasis or repetition related to countable objects or actions
New Auto-Interp
Negative Logits
аза
-0.18
ysa
-0.15
zu
-0.15
udu
-0.14
Chew
-0.14
nett
-0.14
ownload
-0.14
Sokol
-0.14
curity
-0.13
net
-0.13
POSITIVE LOGITS
itial
0.17
ião
0.14
lak
0.14
eru
0.13
elman
0.13
tôn
0.13
iring
0.13
åĽŀæĿ¥
0.13
ÙĦاÙĨ
0.13
FI
0.13
Activations Density 0.043%