INDEX
Explanations
phrases related to someone expressing a statement or opinion
the word "that" in various contexts
New Auto-Interp
Negative Logits
EMBER
-0.71
andem
-0.63
IELD
-0.62
tails
-0.62
ãĤµãĥ¼ãĥĨãĤ£ãĥ¯ãĥ³
-0.62
SHA
-0.60
izont
-0.59
gments
-0.59
STE
-0.59
arest
-0.59
POSITIVE LOGITS
although
0.82
sounded
0.75
"[
0.69
contradicts
0.68
'[
0.68
preceded
0.65
whilst
0.63
"#
0.63
they
0.62
amera
0.61
Activations Density 0.217%