INDEX
    Explanations

    unusual events

    New Auto-Interp
    Negative Logits
    ولد
    -0.07
    bg
    -0.06
    ABS
    -0.06
    θι
    -0.06
     türü
    -0.06
    екси
    -0.06
     засобів
    -0.06
    -0.06
    иц
    -0.06
    -0.06
    POSITIVE LOGITS
    NSNotificationCenter
    0.07
     uncont
    0.06
    (ins
    0.06
    知道
    0.06
    0.06
    structions
    0.06
     نگ
    0.06
     inferior
    0.06
     similar
    0.06
    anism
    0.06
    Act Density 0.001%

    No Known Activations