INDEX
Explanations
phrases related to prevention or hindrance
phrases indicating prevention or obstacles
New Auto-Interp
Negative Logits
bush
-0.74
atl
-0.72
soDeliveryDate
-0.71
nob
-0.69
aic
-0.69
tops
-0.69
oct
-0.68
KC
-0.67
MON
-0.67
uid
-0.67
POSITIVE LOGITS
accessing
1.03
reaching
0.91
completing
0.90
obtaining
0.89
achieving
0.87
entering
0.83
participating
0.83
getting
0.82
being
0.80
taking
0.80
Activations Density 0.054%