INDEX
Explanations
phrases related to leaving or not leaving something
phrases that convey absence or leaving something behind
New Auto-Interp
Negative Logits
ML
-0.82
Cosponsors
-0.81
GW
-0.72
NESS
-0.67
Stats
-0.64
GW
-0.64
adj
-0.63
rix
-0.62
ML
-0.62
MN
-0.62
POSITIVE LOGITS
intact
1.10
untouched
1.09
unanswered
1.06
undone
1.02
unsatisf
0.97
unfinished
0.95
footprints
0.94
unatt
0.92
unexpl
0.90
residue
0.89
Activations Density 0.185%