INDEX
Negative Logits
rejected
-0.07
spacing
-0.07
spent
-0.06
Thin
-0.06
thoải
-0.06
Trafford
-0.06
_PENDING
-0.06
.DialogInterface
-0.06
_td
-0.06
potassium
-0.06
POSITIVE LOGITS
alte
0.06
наблюд
0.06
...,
0.06
accommodations
0.06
зависим
0.06
Exports
0.06
,vector
0.06
entario
0.06
list
0.06
몰
0.06
Activations Density 0.001%