INDEX
    Explanations

    phrases related to some items being out of a place or context

    instances of the word "out."

    New Auto-Interp
    Negative Logits
     arsen
    -0.67
     Pry
    -0.64
     grooming
    -0.63
     refined
    -0.63
     bonded
    -0.62
     melting
    -0.61
     age
    -0.61
     trem
    -0.61
     Puzzles
    -0.58
     irrad
    -0.58
    POSITIVE LOGITS
    lier
    1.06
    door
    1.04
    doors
    1.02
    casts
    1.00
    stretched
    0.93
    out
    0.93
    outs
    0.93
    dated
    0.92
    lander
    0.90
    fitted
    0.89
    Act Density 0.014%

    No Known Activations