INDEX
Explanations
occurrences of the word "when."
references to time-related context, particularly the usage of the word "when"
New Auto-Interp
Negative Logits
agin
-0.75
ictive
-0.74
aptic
-0.68
egu
-0.68
bear
-0.66
educ
-0.64
Die
-0.64
ateral
-0.62
ply
-0.61
icker
-0.61
POSITIVE LOGITS
soever
0.90
fy
0.70
asked
0.70
transitioning
0.69
Prohibition
0.69
contacted
0.68
confronted
0.68
ornia
0.68
designing
0.68
viewed
0.68
Activations Density 0.106%