INDEX
Negative Logits
merusak
-0.76
Freddie
-0.71
umerate
-0.70
卡
-0.69
cards
-0.69
ogeneity
-0.68
AMIENTO
-0.66
מדי
-0.66
curl
-0.65
性和
-0.65
POSITIVE LOGITS
cents
2.53
penny
2.28
pennies
1.81
pence
1.73
penny
1.66
cent
1.60
Penny
1.48
Penny
1.46
cents
1.45
Cents
1.41
Activations Density 0.026%