INDEX
    Explanations
    New Auto-Interp
    Negative Logits
     fatal
    -0.08
     полностью
    -0.08
    (play
    -0.07
     tu
    -0.07
    -bit
    -0.06
    ique
    -0.06
     primitive
    -0.06
     exhaustion
    -0.06
    َق
    -0.06
     nx
    -0.06
    POSITIVE LOGITS
     numberWith
    0.07
    },{"
    0.06
    liğ
    0.06
     فه
    0.06
    asyarak
    0.06
    "]=$
    0.06
    stdcall
    0.06
    ับน
    0.06
    άνι
    0.06
     oluyor
    0.06
    Act Density 0.017%

    No Known Activations