INDEX
Explanations
requests for comments or responses that were not immediately provided or responded to
instances of non-responsiveness in communication
New Auto-Interp
Negative Logits
Treat
-0.70
yton
-0.66
ptin
-0.58
margins
-0.58
assemb
-0.58
lower
-0.57
gran
-0.57
Galile
-0.56
Less
-0.55
joice
-0.55
POSITIVE LOGITS
acknow
0.94
nor
0.89
satisf
0.87
confir
0.84
emailed
0.77
amera
0.72
answering
0.72
substant
0.70
condolences
0.69
reply
0.68
Activations Density 0.130%