INDEX
Explanations
the word "guess" followed by a statement/question
instances of the word "guess" and related phrases indicating uncertainty or making inferences
New Auto-Interp
Negative Logits
interrupted
-0.85
iences
-0.77
contained
-0.69
elight
-0.69
iterranean
-0.67
ĸļ
-0.66
tha
-0.65
obe
-0.65
ksh
-0.65
helle
-0.64
POSITIVE LOGITS
guesses
1.14
guess
0.93
guessed
0.80
guessing
0.80
Guess
0.75
lessly
0.72
itial
0.70
IELD
0.67
lege
0.67
incorrectly
0.66
Activations Density 0.026%