INDEX
Negative Logits
alas
0.74
allic
0.66
unab
0.64
ыл
0.64
ך
0.62
nels
0.61
alls
0.61
यन
0.60
bring
0.60
als
0.59
POSITIVE LOGITS
what
1.50
What
1.46
What
1.37
what
1.31
different
1.13
Different
1.11
diferentes
1.10
different
1.07
Different
1.05
WHAT
1.00
Activations Density 0.233%