INDEX
Explanations
expressions of gratitude
expressions of gratitude and appreciation
New Auto-Interp
Negative Logits
soever
-0.74
soDeliveryDate
-0.68
idth
-0.67
MW
-0.67
enium
-0.67
aults
-0.66
MRI
-0.66
arc
-0.66
heid
-0.66
uns
-0.62
POSITIVE LOGITS
helping
1.03
letting
0.96
trusting
0.93
noticing
0.92
supporting
0.90
contacting
0.89
joining
0.88
generously
0.87
sponsoring
0.85
supplying
0.84
Activations Density 0.058%