INDEX
Explanations
expressions of gratitude
expressions of gratitude or appreciation
New Auto-Interp
Negative Logits
CHR
-0.71
Gi
-0.70
uns
-0.69
alties
-0.69
HS
-0.68
alde
-0.68
arc
-0.67
paio
-0.66
ustain
-0.65
IRE
-0.64
POSITIVE LOGITS
letting
1.28
helping
1.25
stopping
1.22
noticing
1.18
contacting
1.17
allowing
1.16
agreeing
1.16
reminding
1.14
sticking
1.12
joining
1.11
Activations Density 0.059%