INDEX
    Explanations

    the presence of the word "trying to"

    phrases that convey attempts or intentions

    New Auto-Interp
    Negative Logits
    most
    -0.68
     Clouds
    -0.63
     Rising
    -0.62
     Returning
    -0.61
     Appears
    -0.60
     Cros
    -0.59
     Houses
    -0.59
    ILY
    -0.57
     Vol
    -0.56
     Lag
    -0.56
    POSITIVE LOGITS
     recreate
    1.23
     emulate
    1.21
     convince
    1.16
     imitate
    1.15
     revive
    1.14
     replicate
    1.12
     conserve
    1.11
     reconcile
    1.10
     establish
    1.06
     eliminate
    1.06
    Act Density 0.075%

    No Known Activations