INDEX
    Explanations

    phrases related to leaving or escaping

    phrases indicating a desire or need to escape or leave a situation

    New Auto-Interp
    Negative Logits
     heartbeat
    -0.70
     Emin
    -0.62
     Brach
    -0.59
     Hera
    -0.58
    guyen
    -0.54
     gallery
    -0.52
     Huang
    -0.51
     slideshow
    -0.51
     hemisphere
    -0.50
     waning
    -0.49
    POSITIVE LOGITS
    ta
    1.15
     alive
    0.84
    doors
    0.83
    bid
    0.80
    fitted
    0.78
    done
    0.78
    smart
    0.77
    stretched
    0.76
    played
    0.74
    last
    0.72
    Act Density 0.057%

    No Known Activations