INDEX
Explanations
expressions of gratitude and appreciation
expressions of gratitude and acknowledgment of support
New Auto-Interp
Negative Logits
versus
-0.71
illac
-0.67
Guard
-0.67
disadvant
-0.65
ificial
-0.61
downgrade
-0.61
raped
-0.61
shun
-0.59
peril
-0.56
éĸ
-0.56
POSITIVE LOGITS
YOU
0.89
feedback
0.86
volunteers
0.86
contributors
0.83
backers
0.82
applicants
0.79
attendees
0.79
submissions
0.76
sincere
0.75
Feedback
0.74
Activations Density 0.290%