INDEX
Explanations
I'm expressing positive feelings or wishes
New Auto-Interp
Negative Logits
Concern
0.50
Maybe
0.46
Concern
0.44
WARNING
0.44
avoidance
0.44
Need
0.43
alertness
0.43
favoring
0.42
timeouts
0.42
Concerned
0.42
POSITIVE LOGITS
eagerly
1.11
eager
0.95
excited
0.84
excitedly
0.82
thrilled
0.81
hope
0.79
gratefully
0.78
excited
0.77
gladly
0.77
eagerness
0.77
Activations Density 0.009%