INDEX
Explanations
time-related events or transitions in a narrative
New Auto-Interp
Negative Logits
aez
-0.73
orde
-0.67
onomy
-0.65
scription
-0.63
orum
-0.62
TY
-0.62
cos
-0.62
holder
-0.60
md
-0.60
own
-0.60
POSITIVE LOGITS
proceeded
1.12
proceed
0.86
promptly
0.82
succumb
0.81
withdrew
0.80
succumbed
0.76
switched
0.74
recons
0.73
retracted
0.73
reverted
0.72
Activations Density 0.054%