INDEX
    Explanations

    phrases related to avoiding or preventing something negative or undesirable

    New Auto-Interp
    Negative Logits
    geist
    -0.77
    cart
    -0.71
    toc
    -0.68
    bleacher
    -0.68
    cow
    -0.68
    Must
    -0.67
    lease
    -0.66
    rooms
    -0.65
    ammy
    -0.65
    Led
    -0.65
    POSITIVE LOGITS
     detection
    1.26
     pitfalls
    0.97
     collisions
    0.86
     wasting
    0.86
     relegation
    0.84
     accidents
    0.82
     regress
    0.81
     answering
    0.79
     hazards
    0.79
     fou
    0.79
    Act Density 0.624%

    No Known Activations