INDEX
Negative Logits
oluştur
0.45
două
0.45
两个
0.44
三个
0.43
两种
0.42
клави
0.41
有两个
0.40
двумя
0.40
three
0.40
четыре
0.39
POSITIVE LOGITS
unscrupulous
0.55
discredit
0.50
defraud
0.48
exorbit
0.47
liquidate
0.47
outsiders
0.47
disgusting
0.46
falsehood
0.46
foreigners
0.45
scanty
0.45
Activations Density 0.000%