INDEX
Negative Logits
to
-1.58
declaró
-1.50
коля
-1.48
6
-1.46
confirmó
-1.45
кож
-1.42
reveló
-1.41
but
-1.41
之意
-1.41
menyadari
-1.40
POSITIVE LOGITS
borracha
1.83
我们
1.66
ralla
1.62
their
1.57
two
1.53
her
1.50
one
1.49
him
1.48
three
1.48
seven
1.47
Activations Density 0.008%