INDEX
Negative Logits
Verizon
-0.07
abant
-0.06
rellas
-0.06
Fair
-0.06
焼
-0.06
tears
-0.06
lymp
-0.06
Emoji
-0.06
RuleContext
-0.06
UIScreen
-0.06
POSITIVE LOGITS
bondage
0.13
BIND
0.07
competent
0.07
worked
0.07
gamble
0.07
stance
0.07
disable
0.07
benefited
0.06
elegance
0.06
怎么
0.06
Activations Density 0.001%