INDEX
    Explanations

    phrases related to falling into a certain category or situation

    phrases that indicate the concept of falling into traps or categories

    New Auto-Interp
    Negative Logits
    be
    -0.69
    hunt
    -0.68
     indu
    -0.67
     endors
    -0.66
    toe
    -0.65
    enery
    -0.65
     toured
    -0.62
    iere
    -0.62
    conn
    -0.61
    press
    -0.61
    POSITIVE LOGITS
     obscurity
    0.75
     whichever
    0.74
     Disorder
    0.71
    Seg
    0.70
     submission
    0.69
    Role
    0.68
     limbo
    0.67
     trap
    0.66
     position
    0.66
    olester
    0.66
    Act Density 0.075%

    No Known Activations