INDEX
    Explanations

    corrections or revisions in text

    New Auto-Interp
    Negative Logits
    inho
    -0.14
    ls
    -0.13
    tures
    -0.13
     زÙħ
    -0.13
    mine
    -0.13
    idis
    -0.13
     Craig
    -0.13
    alf
    -0.13
     ý
    -0.13
    çIJ´
    -0.13
    POSITIVE LOGITS
    TOTYPE
    0.16
    orrect
    0.16
     correct
    0.16
    šov
    0.16
    ibu
    0.15
     indeb
    0.15
    à¸Ļาม
    0.15
    .om
    0.15
    icontrol
    0.14
    ElementsBy
    0.14
    Act Density 0.014%

    No Known Activations