INDEX
    Explanations
    New Auto-Interp
    Negative Logits
     Player
    -0.06
     Canton
    -0.06
    ------+
    -0.06
    uParam
    -0.06
     Alter
    -0.06
     mural
    -0.06
     Bet
    -0.06
     anlaş
    -0.06
     усп
    -0.06
     nests
    -0.06
    POSITIVE LOGITS
    RGB
    0.07
    0.06
    -disable
    0.06
    jspb
    0.06
    عد
    0.06
    ží
    0.06
    0.06
     undeniable
    0.06
     شماره
    0.06
    Khi
    0.06
    Act Density 0.017%

    No Known Activations