INDEX
Explanations
phrases that indicate a sequence of events or actions
the word "follows" in the context of narrative development
New Auto-Interp
Negative Logits
ldom
-0.71
pite
-0.71
idad
-0.69
zan
-0.69
orc
-0.67
tu
-0.66
vere
-0.66
olla
-0.65
nec
-0.65
oll
-0.65
POSITIVE LOGITS
ĸļ
1.05
suit
0.91
closely
0.79
faithfully
0.74
follows
0.72
bourg
0.71
)=(
0.70
noon
0.67
suit
0.67
=>
0.67
Activations Density 0.026%