INDEX
Explanations
positive descriptors related to performance or quality
words related to positive performance or quality assessments
New Auto-Interp
Negative Logits
hyde
-0.71
billion
-0.70
soDeliveryDate
-0.70
abel
-0.67
mega
-0.66
uthor
-0.65
gren
-0.64
ften
-0.63
Aaron
-0.63
":[
-0.63
POSITIVE LOGITS
results
1.22
odds
1.08
grades
1.06
reviews
1.06
visibility
1.04
intentions
1.03
outcomes
1.02
margins
1.02
luck
0.98
ratings
0.98
Activations Density 0.128%