INDEX
    Explanations
    New Auto-Interp
    Negative Logits
    -0.07
     стара
    -0.07
    δη
    -0.07
    .White
    -0.07
    cstdio
    -0.07
    ewidth
    -0.07
     porte
    -0.06
    jb
    -0.06
    -0.06
    รค
    -0.06
    POSITIVE LOGITS
     recom
    0.15
     Bom
    0.07
     wom
    0.07
     recomm
    0.07
     discrim
    0.07
    egrator
    0.07
     acronym
    0.07
     recon
    0.07
     Broadcom
    0.07
    .TypeOf
    0.07
    Act Density 0.005%

    No Known Activations