INDEX
    Explanations

    scenarios or situations posed as questions, often beginning with "What if" or "What if we"

    conditional questions or scenarios presented with "what if."

    New Auto-Interp
    Negative Logits
    igmatic
    -0.79
    cedented
    -0.77
    vant
    -0.77
    cised
    -0.73
    abre
    -0.73
    nect
    -0.72
     pione
    -0.71
    enfranch
    -0.69
    ply
    -0.69
    20439
    -0.69
    POSITIVE LOGITS
     someday
    1.06
     somebody
    0.90
     someone
    0.88
    ...?
    0.86
     we
    0.80
     they
    0.79
     there
    0.78
     you
    0.78
     somehow
    0.77
     instead
    0.76
    Act Density 0.063%

    No Known Activations