INDEX
Explanations
phrases indicating a future time or sequence of events
references to temporal markers or specific times in the narrative
New Auto-Interp
Negative Logits
afety
-0.87
76561
-0.78
Plot
-0.72
ãĥ´
-0.72
Container
-0.67
tnc
-0.66
Sweep
-0.66
minecraft
-0.66
Simpl
-0.66
utra
-0.65
POSITIVE LOGITS
imester
0.75
ABE
0.71
wiser
0.68
isode
0.68
sidx
0.68
headers
0.68
arrive
0.67
recons
0.66
leases
0.64
aptic
0.63
Activations Density 0.140%