INDEX
    Explanations
    New Auto-Interp
    Negative Logits
     Border
    -0.06
    doc
    -0.06
    Border
    -0.06
    inish
    -0.06
    ANEL
    -0.06
    974
    -0.06
     rooft
    -0.06
     nit
    -0.06
    Doc
    -0.06
    _drive
    -0.06
    POSITIVE LOGITS
    -mail
    0.07
    —we
    0.07
     alıyor
    0.06
    _CAN
    0.06
     bourgeoisie
    0.06
    .Register
    0.06
    emploi
    0.06
     billeder
    0.06
     السعودية
    0.06
    venture
    0.06
    Act Density 0.004%

    No Known Activations