INDEX
Negative Logits
ſever
-1.80
différents
-1.73
diſt
-1.67
различных
-1.59
嵓
-1.55
ſmall
-1.53
verschillende
-1.52
unſ
-1.51
leſs
-1.51
choisir
-1.48
POSITIVE LOGITS
which
2.06
(
1.95
with
1.83
that
1.77
on
1.56
two
1.48
will
1.48
this
1.45
from
1.40
one
1.38
Activations Density 0.024%