INDEX
Explanations
instances of the word "guess" and variations of questioning or speculation
New Auto-Interp
Negative Logits
avras
-0.15
untu
-0.15
allas
-0.15
yms
-0.14
edBy
-0.14
amespace
-0.14
arcer
-0.14
ide
-0.14
entifier
-0.14
unn
-0.14
POSITIVE LOGITS
Guess
0.26
guesses
0.22
work
0.22
guess
0.21
guessing
0.20
guessed
0.20
Guess
0.19
guess
0.18
(es
0.17
correctly
0.17
Activations Density 0.020%