INDEX
Negative Logits
to
-1.95
anunciado
-1.84
the
-1.80
eenvoudig
-1.76
with
-1.75
Then
-1.72
explicado
-1.70
珦
-1.68
in
-1.68
与其
-1.60
POSITIVE LOGITS
8
1.83
4
1.80
6
1.61
所有的
1.61
This
1.58
9
1.56
Our
1.53
2
1.51
7
1.50
They
1.49
Activations Density 0.003%