INDEX
Negative Logits
detach
-0.07
besie
-0.07
taller
-0.07
свои
-0.06
renk
-0.06
duvar
-0.06
restrial
-0.06
evident
-0.06
oxide
-0.06
url
-0.06
POSITIVE LOGITS
helping
0.14
help
0.12
helped
0.11
Helping
0.10
helps
0.10
Help
0.09
(.)
0.08
help
0.08
ERVICE
0.08
:P
0.07
Activations Density 0.088%