INDEX
Negative Logits
beneficiaries
0.52
incriminating
0.47
substantiated
0.46
ய்
0.45
agonists
0.43
주
0.43
repercussions
0.42
orifices
0.42
論
0.42
endgame
0.41
POSITIVE LOGITS
tekst
0.50
Siamo
0.50
Timo
0.49
skapa
0.48
Lana
0.47
al
0.47
थोड़ी
0.46
profesora
0.46
ed
0.46
Arts
0.45
Activations Density 0.005%