INDEX
Explanations
expressions of gratitude and appreciation
references to gratitude and appreciation towards others
New Auto-Interp
Negative Logits
enum
-0.74
defaults
-0.70
ageddon
-0.68
ptive
-0.68
2030
-0.65
Elsewhere
-0.65
modifier
-0.65
Worse
-0.64
ivably
-0.64
dystop
-0.63
POSITIVE LOGITS
kindness
1.28
generosity
1.28
invaluable
1.27
gracious
1.21
generous
1.15
kindly
1.10
generously
1.09
professionalism
1.07
dedication
1.02
thoughtful
1.02
Activations Density 1.341%