INDEX
Explanations
the word "guess" followed by other words or phrases
instances of the expression "guess what" or similar phrases indicating anticipation or rhetorical questions
New Auto-Interp
Negative Logits
iences
-0.84
interrupted
-0.75
contained
-0.74
ĸļ
-0.70
Interstitial
-0.66
enture
-0.65
acca
-0.64
andals
-0.63
agos
-0.63
EVA
-0.62
POSITIVE LOGITS
guesses
1.09
guess
0.99
guessing
0.88
Guess
0.79
itial
0.77
guessed
0.77
incorrectly
0.71
work
0.70
darn
0.69
yp
0.69
Activations Density 0.043%