INDEX
Explanations
positive sentiments or mentions
references to positive outcomes or evaluations
New Auto-Interp
Negative Logits
appings
-0.90
puter
-0.87
oths
-0.86
ngth
-0.84
ptin
-0.81
atum
-0.81
ometimes
-0.78
alian
-0.78
ĸļ
-0.76
hower
-0.76
POSITIVE LOGITS
reinforcement
1.13
feedback
1.01
outcome
0.97
outlook
0.96
vib
0.96
attitude
0.91
appraisal
0.90
affirm
0.86
outcomes
0.84
affirmation
0.84
Activations Density 0.043%