INDEX
Explanations
scenarios or situations posed as questions, often beginning with "What if" or "What if we"
conditional questions or scenarios presented with "what if."
New Auto-Interp
Negative Logits
igmatic
-0.79
cedented
-0.77
vant
-0.77
cised
-0.73
abre
-0.73
nect
-0.72
pione
-0.71
enfranch
-0.69
ply
-0.69
20439
-0.69
POSITIVE LOGITS
someday
1.06
somebody
0.90
someone
0.88
...?
0.86
we
0.80
they
0.79
there
0.78
you
0.78
somehow
0.77
instead
0.76
Activations Density 0.063%