INDEX
Explanations
phrases indicating a comparison or decision-making moment
conditional phrases or contexts that start with "when."
New Auto-Interp
Negative Logits
vantage
-0.83
voice
-0.74
rieve
-0.73
tyard
-0.72
ve
-0.70
gan
-0.70
entry
-0.70
agin
-0.68
ilon
-0.68
cu
-0.68
POSITIVE LOGITS
soever
0.78
contrasted
0.76
compared
0.73
they
0.72
thou
0.67
expecting
0.66
polls
0.64
THEY
0.64
=~
0.64
simultaneously
0.64
Activations Density 0.170%