INDEX
Negative Logits
-esteem
-0.08
BET
-0.07
suffix
-0.06
Lois
-0.06
children
-0.06
smith
-0.06
untreated
-0.06
dialog
-0.06
ladies
-0.06
carpets
-0.06
POSITIVE LOGITS
_REQ
0.07
دهید
0.06
A
0.06
議
0.06
Outlook
0.06
POSITORY
0.06
Reached
0.06
.runtime
0.06
childcare
0.06
몰
0.06
Activations Density 0.016%