INDEX
Explanations
actions or statements related to abandoning a course of action or belief
instances of the word "abandon" in various contexts
New Auto-Interp
Negative Logits
gio
-0.66
annis
-0.65
inals
-0.65
uid
-0.61
inn
-0.60
gon
-0.59
iov
-0.59
eway
-0.57
eyed
-0.57
deals
-0.57
POSITIVE LOGITS
abandonment
0.91
abandon
0.86
plin
0.79
ãĥĺ
0.77
unres
0.76
elsen
0.73
Abandon
0.72
disbelief
0.71
ditch
0.71
uncond
0.70
Activations Density 0.016%