INDEX
    Explanations
    New Auto-Interp
    Negative Logits
     betweenstory
    -0.69
    inheit
    -0.66
    таратура
    -0.56
    POSURE
    -0.51
    chequer
    -0.49
    ushy
    -0.48
     desto
    -0.47
    Cormack
    -0.47
    })]
    -0.47
    pahan
    -0.46
    POSITIVE LOGITS
     birds
    1.07
     bird
    1.05
     Bird
    1.01
     Birds
    1.01
    Birds
    1.01
     BIRD
    0.99
     ornith
    0.97
    Bird
    0.94
    bird
    0.85
     BIRDS
    0.85
    Act Density 0.133%

    No Known Activations