INDEX
Negative Logits
Somehow
0.89
ことにより
0.86
njemu
0.85
jego
0.84
които
0.84
so
0.83
conseguenza
0.83
považ
0.82
edilmesi
0.82
jotka
0.82
POSITIVE LOGITS
though
1.39
some
1.13
caveats
1.12
upfront
1.09
several
1.08
while
1.05
:
1.05
though
0.98
two
0.98
cybersecurity
0.97
Activations Density 0.026%