INDEX
    Explanations
    New Auto-Interp
    Negative Logits
    书记
    -0.07
    classifier
    -0.06
     mainWindow
    -0.06
    variant
    -0.06
    ीदव
    -0.06
    _penalty
    -0.06
    ']=$
    -0.06
    ivent
    -0.06
    staking
    -0.06
     //////
    -0.06
    POSITIVE LOGITS
    Thank
    0.07
     upsetting
    0.07
    Article
    0.07
    ظة
    0.06
    _HEX
    0.06
     zcela
    0.06
     правила
    0.06
     Comment
    0.06
    .cap
    0.06
    Found
    0.06
    Act Density 0.002%

    No Known Activations