INDEX
Negative Logits
億
-0.09
-0.08
Term
-0.08
stre
-0.07
termos
-0.07
હર
-0.07
永
-0.07
esigen
-0.07
_per
-0.07
遠
-0.07
POSITIVE LOGITS
Though
0.12
That
0.11
Though
0.11
That
0.11
though
0.11
There
0.11
Although
0.10
Thus
0.10
Thus
0.10
þeir
0.10
Activations Density 0.003%