INDEX
Negative Logits
ہ
1.23
тился
1.05
приобрета
1.05
choisi
1.03
هُ
1.02
하였
1.02
𝗶
1.02
1.02
Freude
1.01
Chakraborty
1.01
POSITIVE LOGITS
harassing
0.93
takes
0.88
takes
0.88
breaks
0.87
suffers
0.87
Unusual
0.86
hasn
0.84
hates
0.84
pulls
0.83
Policing
0.83
Activations Density 0.000%