INDEX
Negative Logits
mitral
0.43
जीलैंड
0.42
birdseye
0.41
malef
0.41
dailySales
0.39
centrif
0.39
بیماری
0.39
frauen
0.38
护理
0.38
analisi
0.38
POSITIVE LOGITS
бята
0.36
Talk
0.33
ու
0.33
एनजी
0.32
스스로
0.32
Talk
0.32
бесе
0.32
LGBT
0.31
Chat
0.31
mixture
0.31
Activations Density 0.005%