INDEX
Negative Logits
नक्की
0.62
द्रा
0.61
welcome
0.57
امة
0.57
+
0.56
accarat
0.56
+
0.55
define
0.55
defines
0.55
definitely
0.55
POSITIVE LOGITS
reasons
1.26
razones
1.25
出于
1.16
Reasons
1.13
alasan
1.09
种种
1.05
raisons
1.04
expediency
1.04
Reasons
1.04
担心
1.03
Activations Density 0.433%