INDEX
Explanations
words related to negative consequences or harm, particularly focused on losses of various kinds
references to various types of loss, particularly in contexts related to emotional, environmental, or economic aspects
New Auto-Interp
Negative Logits
tee
-0.67
Aires
-0.67
ECK
-0.66
tranquil
-0.64
enegger
-0.63
Å¡
-0.60
ENTS
-0.58
Beans
-0.58
imen
-0.58
abad
-0.58
POSITIVE LOGITS
luster
1.11
iem
1.10
aversion
1.05
incurred
1.00
lust
0.89
suffered
0.87
inflicted
0.82
esville
0.82
byss
0.81
prevention
0.81
Activations Density 0.049%