INDEX
    Explanations

    references to future plans or intentions

    references to future intentions or strategies

    New Auto-Interp
    Negative Logits
    wcsstore
    -0.77
     Stain
    -0.75
    asions
    -0.68
    ruciating
    -0.67
    selves
    -0.67
    isha
    -0.65
     Jude
    -0.65
    irrel
    -0.64
     Chin
    -0.63
    mint
    -0.61
    POSITIVE LOGITS
     plans
    1.14
     Plans
    1.08
     rollout
    0.85
    paren
    0.80
     reimburse
    0.80
     lawy
    0.76
     plan
    0.75
     ambitions
    0.73
    plan
    0.71
     screenings
    0.71
    Act Density 0.025%

    No Known Activations