INDEX
    Explanations

    phrases indicating presence, assistance, or declaration

    phrases indicating purpose or intention

    New Auto-Interp
    Negative Logits
     reliance
    -0.74
     attributed
    -0.69
     resorted
    -0.68
     reliant
    -0.67
     fielded
    -0.62
     ancest
    -0.62
     synchronization
    -0.62
     Classification
    -0.60
     attributable
    -0.60
     exposures
    -0.59
    POSITIVE LOGITS
    brate
    0.92
     celebrate
    0.83
     stay
    0.83
    wark
    0.80
    orate
    0.79
    ivo
    0.77
    othe
    0.77
     uphold
    0.76
    attery
    0.76
    swer
    0.75
    Act Density 0.119%

    No Known Activations