INDEX
Explanations
events or actions happening in a sequence
instances of the word "when" indicating temporal sequences or events
New Auto-Interp
Negative Logits
zag
-0.82
Í
-0.75
ouble
-0.72
species
-0.71
cheat
-0.71
JV
-0.71
whatever
-0.70
igmatic
-0.70
abal
-0.69
tick
-0.68
POSITIVE LOGITS
asked
1.49
confronted
1.42
questioned
1.34
contacted
1.33
pressed
1.28
approached
1.15
quizz
1.11
soever
1.04
interviewed
1.03
challenged
1.03
Activations Density 0.108%