INDEX
Explanations
words related to negative outcomes or consequences
references to loss or its consequences
New Auto-Interp
Negative Logits
ENTS
-0.70
inventive
-0.68
ATT
-0.66
rouse
-0.65
ECK
-0.63
dotted
-0.62
thodox
-0.62
":[{"-0.62
ansky
-0.61
Occupations
-0.61
POSITIVE LOGITS
loss
1.11
Loss
1.06
loss
1.04
aversion
0.97
iem
0.89
losses
0.89
byss
0.82
experien
0.73
landfall
0.73
luster
0.72
Activations Density 0.010%