INDEX
Negative Logits
presumably
1.29
로
1.27
४
1.19
ная
1.19
ст
1.18
ik
1.17
PFA
1.13
і
1.12
ﮏ
1.12
其實
1.11
POSITIVE LOGITS
तमंद
2.17
chắn
2.02
footed
1.79
પણે
1.62
likle
1.49
tte
1.49
weten
1.47
ll
1.40
ty
1.39
glad
1.37
Activations Density 0.031%
presumably
로
४
ная
ст
ik
PFA
і
ﮏ
其實
तमंद
chắn
footed
પણે
likle
tte
weten
ll
ty
glad