INDEX
Explanations
instances of events happening at specific locations and times
the word "when" repeatedly signaling temporal context in narratives
New Auto-Interp
Negative Logits
agin
-0.73
ictive
-0.73
aches
-0.65
thal
-0.65
aido
-0.64
harm
-0.63
opt
-0.63
rolet
-0.63
bear
-0.62
-0.61
POSITIVE LOGITS
soever
1.15
*/(
0.78
irlf
0.77
confronted
0.76
pressed
0.75
asked
0.74
wcsstore
0.70
EStream
0.70
they
0.70
faced
0.68
Activations Density 0.122%