INDEX
Negative Logits
was
1.31
yourself
1.14
Was
1.14
Was
1.06
your
1.04
WAS
1.02
was
1.01
meus
0.98
cor
0.97
my
0.96
POSITIVE LOGITS
themselves
2.41
他们的
2.37
他們的
2.36
Their
2.32
their
2.31
ihre
2.15
deres
2.14
their
2.12
纷纷
2.10
தங்கள்
2.09
Activations Density 1.026%