INDEX
Negative Logits
in
-2.50
more
-2.45
make
-2.42
epitom
-2.36
just
-2.33
With
-2.31
on
-2.28
unrelenting
-2.28
oddly
-2.25
at
-2.23
POSITIVE LOGITS
frança
2.39
為に
2.31
2.27
verschillende
2.22
2.20
Dinas
2.16
᛬
2.16
だけではなく
2.13
ܤ
2.11
horrid
2.08
Activations Density 0.001%