INDEX
Negative Logits
вар
-0.07
sto
-0.07
IID
-0.06
ders
-0.06
oloj
-0.06
هنوز
-0.06
acı
-0.06
)}.
-0.06
durum
-0.06
krb
-0.06
POSITIVE LOGITS
Towards
0.08
Increased
0.07
education
0.07
ención
0.07
Yorkshire
0.07
октября
0.07
companies
0.07
"They
0.07
repayment
0.07
direction
0.06
Activations Density 0.000%