INDEX
Explanations
words related to hypothetical scenarios or future possibilities
conditional phrases involving hypothetical scenarios
New Auto-Interp
Negative Logits
IPM
-0.74
Dy
-0.70
laced
-0.68
lines
-0.65
Lawrence
-0.64
down
-0.64
Dixon
-0.64
Trop
-0.63
Sandy
-0.63
Downs
-0.63
POSITIVE LOGITS
be
1.55
have
1.08
issue
1.06
been
1.06
bes
1.00
BE
0.97
minded
0.95
Be
0.95
bel
0.93
they
0.93
Activations Density 0.038%