INDEX
Explanations
noun phrases related to hypothetical scenarios or outcomes
phrases and terms related to future possibilities and outcomes
New Auto-Interp
Negative Logits
resent
-0.66
uclear
-0.64
Vil
-0.62
ython
-0.59
nowhere
-0.57
wonder
-0.56
Votes
-0.56
kept
-0.56
misplaced
-0.56
Scot
-0.54
POSITIVE LOGITS
entails
0.78
entail
0.76
.</
0.74
.?
0.74
!?
0.72
.--
0.69
.<
0.67
.—
0.66
.;
0.66
hetti
0.65
Activations Density 0.295%