INDEX
Negative Logits
німа
0.68
龔
0.67
Dhabi
0.63
atis
0.63
క్త
0.62
щее
0.61
瑚
0.61
न्तु
0.59
Nair
0.59
bellum
0.58
POSITIVE LOGITS
berd
0.66
blown
0.66
नियु
0.62
ant
0.61
ори
0.61
vstack
0.58
aisen
0.58
Bowen
0.58
Trif
0.58
hard
0.58
Activations Density 0.113%