INDEX
    Explanations
    New Auto-Interp
    Negative Logits
    imestep
    -0.07
     системы
    -0.06
    ени
    -0.06
    redit
    -0.06
     factory
    -0.06
     glorious
    -0.06
    fontsize
    -0.06
     yr
    -0.06
     timeout
    -0.06
     ورزشی
    -0.06
    POSITIVE LOGITS
     avoid
    0.08
     avoided
    0.08
    0.08
     avoiding
    0.08
    .autoconfigure
    0.07
     avoids
    0.07
     prevent
    0.07
    0.06
     Avoid
    0.06
     particularly
    0.06
    Act Density 0.011%

    No Known Activations