INDEX
    Explanations
    New Auto-Interp
    Negative Logits
    yards
    -0.07
    laden
    -0.07
    allel
    -0.06
     deployment
    -0.06
     confl
    -0.06
    ador
    -0.06
    odal
    -0.06
    \Html
    -0.06
    otechnology
    -0.06
    [to
    -0.06
    POSITIVE LOGITS
    จร
    0.06
    DOCTYPE
    0.06
    /goto
    0.06
    0.06
    0.06
     athletics
    0.06
     Feather
    0.06
     Leave
    0.06
    addAction
    0.06
     αυτή
    0.06
    Act Density 0.048%

    No Known Activations