INDEX
Explanations
actions involving providing assistance or services
phrases that indicate willingness to provide assistance or services
New Auto-Interp
Negative Logits
Bey
-0.78
assault
-0.73
NAS
-0.69
ieties
-0.69
Bundy
-0.68
enes
-0.67
Amid
-0.67
SpaceEngineers
-0.67
uve
-0.66
berman
-0.65
POSITIVE LOGITS
refund
1.12
kindly
1.07
recommend
1.07
notify
1.01
gladly
1.00
advise
0.96
reimburse
0.94
reimb
0.94
publish
0.92
anonym
0.92
Activations Density 0.492%