INDEX
Negative Logits
선정
-0.09
recommandé
-0.08
colonne
-0.08
Reference
-0.08
catal
-0.08
توص
-0.07
curry
-0.07
buah
-0.07
RP
-0.07
રંગ
-0.07
POSITIVE LOGITS
disclosures
0.10
autobi
0.10
dichiar
0.09
Disclosure
0.09
declaration
0.09
voluntarily
0.09
заявил
0.09
Declarations
0.09
autobiography
0.09
confess
0.09
Activations Density 0.332%