INDEX
Negative Logits
It
-2.78
2
-2.33
They
-2.31
after
-2.31
This
-2.19
After
-2.16
There
-2.14
With
-1.97
If
-1.95
You
-1.94
POSITIVE LOGITS
筇
2.08
zape
1.87
least
1.84
který
1.82
閦
1.80
distintas
1.79
cambian
1.79
самая
1.78
ofthe
1.78
1.77
Activations Density 0.013%