INDEX
    Explanations

    questions or statements inquiring about the outcome or progress of a situation

    inquiries related to outcomes or consequences

    New Auto-Interp
    Negative Logits
    riter
    -0.71
    "}],"
    -0.68
    archives
    -0.65
     WATCHED
    -0.63
    ul
    -0.63
    UTH
    -0.62
    visory
    -0.62
    athom
    -0.61
    apt
    -0.59
    ulo
    -0.59
    POSITIVE LOGITS
     reaction
    0.81
     reactions
    0.71
     fuss
    0.70
     fared
    0.69
    okin
    0.61
     attrition
    0.61
     truce
    0.60
     shenanigans
    0.59
     soph
    0.59
     develops
    0.59
    Act Density 0.086%

    No Known Activations