INDEX
Negative Logits
اكثر
0.36
ренного
0.36
میرے
0.36
ponemos
0.35
ఫీ
0.35
సో
0.35
cowboys
0.35
వె
0.34
이걸
0.34
lessen
0.34
POSITIVE LOGITS
which
0.40
which
0.35
whose
0.35
although
0.34
described
0.33
<sup>
0.33
published
0.33
Gómez
0.33
García
0.32
Davis
0.32
Activations Density 0.025%