INDEX
Negative Logits
parar
-0.09
yaxşı
-0.08
Rhine
-0.08
ahụ
-0.08
baths
-0.07
rind
-0.07
bandh
-0.07
të
-0.07
Voy
-0.07
Ott
-0.07
POSITIVE LOGITS
boost
0.07
_report
0.07
condição
0.07
(dep
0.07
CONDITION
0.07
liable
0.07
secure
0.07
reporting
0.07
olfo
0.07
时候
0.07
Activations Density 0.003%