INDEX
    Explanations
    New Auto-Interp
    Negative Logits
    Catch
    -0.09
    -0.09
     uniek
    -0.09
    univers
    -0.08
    .Object
    -0.08
    Construction
    -0.08
     сами
    -0.08
     обществ
    -0.08
    конт
    -0.08
    Creator
    -0.08
    POSITIVE LOGITS
     Mathf
    0.09
     TRANS
    0.08
     mush
    0.07
     quieter
    0.07
     Botox
    0.07
     diffic
    0.07
     transt
    0.07
     demanded
    0.07
    Tras
    0.07
    /from
    0.07
    Act Density 0.012%

    No Known Activations