INDEX
    Explanations
    New Auto-Interp
    Negative Logits
     nothing
    -0.07
     fell
    -0.06
     climate
    -0.06
     port
    -0.06
    šit
    -0.06
     advert
    -0.06
    -0.06
    іту
    -0.06
     remot
    -0.06
    -wing
    -0.06
    POSITIVE LOGITS
     Shuffle
    0.09
     shuffle
    0.08
    .shuffle
    0.08
    وفي
    0.08
    .scrollTo
    0.07
     toplam
    0.07
    Crud
    0.07
    ของร
    0.07
     tangled
    0.07
     επί
    0.07
    Act Density 0.002%

    No Known Activations