INDEX
Explanations
words related to leaving a job or situation
New Auto-Interp
Negative Logits
Catalog
-0.59
inen
-0.58
ANS
-0.56
---------------
-0.55
arov
-0.53
Herm
-0.51
arthy
-0.51
operated
-0.51
Grant
-0.50
HOU
-0.50
POSITIVE LOGITS
smoking
0.88
ters
0.81
ting
0.75
Smoking
0.72
Quit
0.67
quitting
0.64
eating
0.57
whining
0.56
smoking
0.55
quit
0.53
Activations Density 4.139%