INDEX
Explanations
phrases indicating a negative consequence or impact
phrases indicating abandonment or loss
New Auto-Interp
Negative Logits
wcsstore
-0.78
buster
-0.67
Refer
-0.62
wana
-0.62
intend
-0.61
Refer
-0.60
percent
-0.60
relayed
-0.59
breaker
-0.59
inance
-0.59
POSITIVE LOGITS
undone
1.03
scars
0.95
overs
0.93
unanswered
0.87
behind
0.85
unexpl
0.82
footprints
0.79
untold
0.79
unexplained
0.79
him
0.77
Activations Density 0.035%