INDEX
Negative Logits
and
-0.66
providedIn
-0.66
批
-0.60
gewor
-0.59
şun
-0.56
awaiter
-0.56
iaitu
-0.54
そして
-0.52
and
-0.52
"]();
-0.52
POSITIVE LOGITS
if
0.61
it
0.59
when
0.59
by
0.55
the
0.55
in
0.53
during
0.52
anytime
0.52
whenever
0.52
you
0.51
Activations Density 0.015%