INDEX
    Explanations
    New Auto-Interp
    Negative Logits
     sprites
    -0.10
    entral
    -0.09
    prite
    -0.09
    upon
    -0.08
    Sprite
    -0.08
    -0.08
    لية
    -0.08
    Sprites
    -0.08
    :+
    -0.08
    ขึ้น
    -0.08
    POSITIVE LOGITS
    .Password
    0.09
     Writer
    0.08
     LS
    0.08
     toep
    0.08
     گذ
    0.08
     Calls
    0.08
     Fro
    0.08
     FV
    0.08
     Eisen
    0.08
     FH
    0.08
    Act Density 0.002%

    No Known Activations