INDEX
    Explanations

    phrases related to bouncing back or returning to a previous state or position

    New Auto-Interp
    Negative Logits
     caut
    -0.63
    ision
    -0.62
     Newly
    -0.58
     NEWS
    -0.58
     notoriously
    -0.57
     cowork
    -0.57
    orst
    -0.56
     understatement
    -0.56
     inexper
    -0.56
    weather
    -0.56
    POSITIVE LOGITS
    fires
    1.09
    fired
    1.02
    packs
    1.00
    dated
    0.98
    doors
    0.91
    tracking
    0.90
    home
    0.82
    spin
    0.81
     home
    0.80
    wards
    0.80
    Act Density 0.035%

    No Known Activations