INDEX
Explanations
time-related expressions or uncertainties
occurrences of the word "when."
New Auto-Interp
Negative Logits
agin
-0.75
gan
-0.71
rolet
-0.66
gur
-0.66
zzi
-0.65
idan
-0.64
athom
-0.61
yre
-0.60
aine
-0.60
yi
-0.60
POSITIVE LOGITS
soever
1.36
irlf
0.96
confronted
0.81
abouts
0.77
faced
0.76
pressed
0.71
comparing
0.71
asked
0.68
theless
0.68
IPS
0.66
Activations Density 0.133%