INDEX
Negative Logits
were
-0.07
дод
-0.07
fortified
-0.07
_union
-0.06
пы
-0.06
(face
-0.06
surgeries
-0.06
commonplace
-0.06
_amp
-0.06
Sup
-0.06
POSITIVE LOGITS
0.06
Initiative
0.06
капит
0.06
ERENCE
0.06
84
0.06
northeastern
0.06
причина
0.06
Gone
0.06
Blog
0.05
FIX
0.05
Activations Density 0.000%