INDEX
    Explanations
    New Auto-Interp
    Negative Logits
     Zack
    -0.07
     pornofil
    -0.06
     uzak
    -0.06
    -Key
    -0.06
    .size
    -0.06
    okia
    -0.06
    -0.06
    "id
    -0.06
     Bread
    -0.06
    Mvc
    -0.06
    POSITIVE LOGITS
     convenient
    0.08
    869
    0.07
     billions
    0.07
    ーボ
    0.07
    atement
    0.07
     ------------------------------------------------------------
    0.07
    -active
    0.06
     blocks
    0.06
    Monthly
    0.06
    -minus
    0.06
    Act Density 0.002%

    No Known Activations