INDEX
Explanations
dialogue or quotation marks surrounding speech
New Auto-Interp
Negative Logits
standardized
-0.82
denomin
-0.81
tariff
-0.81
manufacturer
-0.80
tablet
-0.79
licens
-0.78
assigned
-0.78
endors
-0.78
wage
-0.76
administr
-0.75
POSITIVE LOGITS
Dirty
1.24
Panic
1.22
Goodbye
1.21
Revenge
1.20
Haunted
1.20
Fever
1.20
Sleeping
1.19
Trouble
1.16
Falling
1.15
Salvation
1.15
Activations Density 0.103%