INDEX
Negative Logits
0.61
\
0.59
you
0.58
5
0.56
you
0.50
ن
0.48
use
0.48
1
0.48
~
0.48
mu
0.47
POSITIVE LOGITS
ได้อย่าง
0.52
óż
0.49
cậu
0.48
După
0.48
🕜
0.47
捚
0.47
πάντα
0.47
❌
0.47
Однако
0.46
întreb
0.46
Activations Density 0.003%
\
you
5
you
ن
use
1
~
mu
ได้อย่าง
óż
cậu
După
🕜
捚
πάντα
❌
Однако
întreb