INDEX
Explanations
expressions of gratitude and appreciation
New Auto-Interp
Negative Logits
thank
-0.29
Thank
-0.27
thanking
-0.26
thanked
-0.24
Thank
-0.23
thank
-0.23
Thanks
-0.22
thanks
-0.22
THANK
-0.22
Thanks
-0.19
POSITIVE LOGITS
appreciated
0.32
appreciate
0.31
Apprec
0.28
appreciation
0.25
apprec
0.19
ToOne
0.16
uger
0.16
áp
0.15
olist
0.15
OMUX
0.14
Activations Density 0.063%