INDEX
    Explanations
    New Auto-Interp
    Negative Logits
    -0.08
     peculiar
    -0.08
     weird
    -0.07
     Plan
    -0.07
     Primer
    -0.07
    cart
    -0.07
     Organiz
    -0.07
     Plateau
    -0.07
    TC
    -0.07
     Turn
    -0.07
    POSITIVE LOGITS
     duk
    0.09
     boasts
    0.09
     предлагает
    0.09
    жээ
    0.08
    Upgradeable
    0.08
     woo
    0.08
     sun
    0.08
    াধিক
    0.08
     концерт
    0.08
     востреб
    0.08
    Act Density 0.006%

    No Known Activations