INDEX
Explanations
positive words and phrases, possibly related to favorable outcomes or aspects
New Auto-Interp
Negative Logits
puter
-0.90
ĸļ
-0.87
hid
-0.87
ometimes
-0.84
plain
-0.81
ptin
-0.80
appings
-0.79
pread
-0.78
alian
-0.78
alus
-0.77
POSITIVE LOGITS
reinforcement
1.09
outlook
1.06
feedback
1.06
attitude
1.02
vib
0.99
outcome
0.95
appraisal
0.94
affirmation
0.89
affirm
0.87
gearing
0.86
Activations Density 0.809%