INDEX
    Explanations

    phrases related to discarding or removing items

    New Auto-Interp
    Negative Logits
     فريبيس
    -0.51
    Personendaten
    -0.49
     kasarigan
    -0.44
    fjspx
    -0.42
     jsPsych
    -0.42
    ppuden
    -0.41
    TintMode
    -0.40
     ComVisible
    -0.38
    !*\
    -0.37
    StoryboardSegue
    -0.36
    POSITIVE LOGITS
     discarding
    0.67
     discard
    0.67
     discarded
    0.66
     reject
    0.64
    Throwaway
    0.63
    throwaway
    0.61
     rejects
    0.59
     Discard
    0.58
     rejected
    0.57
    throw
    0.57
    Act Density 0.003%

    No Known Activations