INDEX
Negative Logits
>("-0.08
_except
-0.08
captura
-0.08
spod
-0.08
blossoms
-0.08
encode
-0.07
intercept
-0.07
ikus
-0.07
Capture
-0.07
stre
-0.07
POSITIVE LOGITS
obses
0.11
zarar
0.10
dangerously
0.10
riesgos
0.10
hurried
0.10
dangers
0.10
risky
0.10
harmful
0.09
gesundheit
0.09
belast
0.09
Activations Density 0.053%