INDEX
Explanations
references to quitting or giving up on jobs or activities
New Auto-Interp
Negative Logits
away
-0.16
Away
-0.15
ordeal
-0.15
ION
-0.14
vac
-0.14
752
-0.14
Weiss
-0.14
ael
-0.14
_WRAP
-0.14
procs
-0.14
POSITIVE LOGITS
altogether
0.18
cold
0.17
mid
0.17
quit
0.16
tright
0.16
smoking
0.16
/start
0.15
Smoking
0.15
олод
0.15
маз
0.15
Activations Density 0.028%