INDEX
Negative Logits
to
-1.26
their
-0.99
exercí
-0.98
true
-0.96
tml
-0.95
erweise
-0.94
such
-0.92
have
-0.91
自作
-0.90
other
-0.90
POSITIVE LOGITS
vocations
1.16
familières
1.11
zwischen
1.07
gånger
1.05
ÁN
1.05
orrho
1.04
mtable
1.01
Кто
1.00
……”
1.00
котором
1.00
Activations Density 0.124%