INDEX
    Explanations

    the word "do" in various forms and contexts

    New Auto-Interp
    Negative Logits
     Efq
    -1.19
     nahilalakip
    -1.16
     &___
    -1.06
    WriteBarrier
    -1.02
     pleaſure
    -0.99
    ſelf
    -0.97
    CloseOperation
    -0.96
     مرئيه
    -0.94
    GEBURTSDATUM
    -0.93
     مشين
    -0.93
    POSITIVE LOGITS
    0.63
     I
    0.62
     done
    0.62
     form
    0.57
     In
    0.54
     is
    0.53
     in
    0.52
    ,
    0.52
     `
    0.52
     so
    0.51
    Act Density 0.129%

    No Known Activations