INDEX
Negative Logits
first
-1.98
for
-1.88
其余
-1.77
gesund
-1.68
only
-1.63
甞
-1.60
by
-1.56
where
-1.56
in
-1.55
الأولى
-1.55
POSITIVE LOGITS
flera
1.89
nästan
1.78
k
1.77
ቼ
1.77
鲼
1.77
naturligt
1.76
ignment
1.76
-\\
1.75
möjlighet
1.72
oting
1.68
Activations Density 0.079%