INDEX
Explanations
references to positive and negative impacts or outcomes
New Auto-Interp
Negative Logits
ĸļ
-0.88
flies
-0.87
puter
-0.87
under
-0.84
abies
-0.81
pots
-0.78
hid
-0.78
tower
-0.76
tub
-0.76
amura
-0.76
POSITIVE LOGITS
attitude
0.91
feedback
0.90
outlook
0.89
reinforcement
0.88
effect
0.81
appraisal
0.80
gearing
0.79
outcome
0.79
effects
0.78
ities
0.78
Activations Density 2.990%