INDEX
Negative Logits
Chosen
-0.07
rehab
-0.07
promoters
-0.07
обходимо
-0.07
완료
-0.07
rehabilitation
-0.07
avoidable
-0.07
.states
-0.07
위한
-0.07
arma
-0.07
POSITIVE LOGITS
straw
0.09
NCR
0.08
gaw
0.08
obi
0.08
anish
0.07
גבוה
0.07
ън
0.07
mnemonic
0.07
دقيقة
0.07
munch
0.07
Activations Density 0.000%