INDEX
Explanations
references to continuing or persisting actions
phrases indicating continuity or ongoing actions
New Auto-Interp
Negative Logits
Surprise
-0.81
soDeliveryDate
-0.73
Yosemite
-0.72
Mesa
-0.69
Abdel
-0.68
Sierra
-0.68
gems
-0.65
Clothing
-0.65
Bet
-0.63
Fancy
-0.62
POSITIVE LOGITS
uninterrupted
0.96
adolesc
0.85
abre
0.81
footsteps
0.79
tradition
0.77
downward
0.73
liction
0.73
steady
0.72
unchanged
0.71
trend
0.71
Activations Density 0.157%