INDEX
Explanations
instances of actions or events happening at a specific time or location
temporal phrases indicating specific moments in narratives
New Auto-Interp
Negative Logits
Grade
-0.63
ãĥ¡
-0.63
hack
-0.63
favorite
-0.61
equal
-0.61
Fine
-0.61
95
-0.59
Justice
-0.59
87
-0.58
trimmed
-0.58
POSITIVE LOGITS
soever
0.94
suddenly
0.84
opus
0.83
*/(
0.82
they
0.81
confronted
0.80
abouts
0.78
she
0.77
he
0.73
tragedy
0.72
Activations Density 0.071%