INDEX
Negative Logits
obviously
0.45
pins
0.44
absur
0.43
passer
0.42
0.41
pauses
0.40
transport
0.40
amour
0.39
dun
0.39
insignia
0.39
POSITIVE LOGITS
વપરા
0.50
општи
0.49
使用的
0.46
కథ
0.45
जेटली
0.45
ниципа
0.44
Sử
0.44
叅
0.44
সাধারণ
0.44
ાય
0.43
Activations Density 0.000%