INDEX
Explanations
instances of events or actions happening at a specific point in time
occurrences of the word "when" to indicate time references
New Auto-Interp
Negative Logits
ouble
-0.84
ictive
-0.80
whatever
-0.74
iosyncr
-0.73
gem
-0.72
uce
-0.69
omal
-0.68
arling
-0.66
kaya
-0.65
educ
-0.64
POSITIVE LOGITS
soever
1.12
he
1.04
they
1.03
asked
1.02
confronted
0.99
contacted
0.93
she
0.91
faced
0.85
interviewed
0.82
approached
0.81
Activations Density 0.103%