INDEX
    Explanations

    phrases indicating missed opportunities or alternate scenarios

    phrases related to hypothetical situations and outcomes

    New Auto-Interp
    Negative Logits
     Deb
    -0.63
     Brach
    -0.63
    thus
    -0.61
    ftime
    -0.59
    verend
    -0.59
    Must
    -0.58
    most
    -0.58
    currently
    -0.58
    tainment
    -0.57
    ennett
    -0.57
    POSITIVE LOGITS
     spared
    0.84
     avoided
    0.82
     prevented
    0.78
     born
    0.76
    wolves
    0.75
     invented
    0.74
     saved
    0.74
     sooner
    0.74
     worse
    0.73
    hes
    0.73
    Act Density 0.095%

    No Known Activations