INDEX
    Explanations
    New Auto-Interp
    Negative Logits
    leitung
    -0.07
     "=",
    -0.07
     миним
    -0.07
     ineff
    -0.07
    -0.07
    етич
    -0.07
    wild
    -0.07
    claimer
    -0.06
    _ghost
    -0.06
    .plus
    -0.06
    POSITIVE LOGITS
     childcare
    0.06
    Sheet
    0.06
    memberOf
    0.06
     لم
    0.06
     percussion
    0.06
     //-
    0.06
    ToAdd
    0.06
    0.06
    `↵↵
    0.06
     благод
    0.06
    Act Density 0.008%

    No Known Activations