INDEX
Explanations
instances of the word "when" in statements indicating uncertainty or lack of knowledge
occurrences of the word "when."
New Auto-Interp
Negative Logits
agin
-0.66
zzi
-0.65
kaya
-0.61
rolet
-0.61
gur
-0.59
bear
-0.59
actor
-0.59
endish
-0.59
elman
-0.59
edly
-0.59
POSITIVE LOGITS
soever
1.31
irlf
0.94
abouts
0.82
confronted
0.79
faced
0.73
IPS
0.72
theless
0.70
pressed
0.66
asked
0.66
transitioning
0.65
Activations Density 0.121%