INDEX
    Explanations
    New Auto-Interp
    Negative Logits
    entu
    -0.08
     Qa
    -0.08
     grayscale
    -0.08
     σου
    -0.07
    .<
    -0.07
     Ua
    -0.07
     impos
    -0.07
    。当然
    -0.07
    <ul
    -0.07
    <img
    -0.07
    POSITIVE LOGITS
     colleagues
    0.08
     ko
    0.08
    _Num
    0.08
     collègues
    0.08
    illeri
    0.08
     Kollegen
    0.08
     coleg
    0.08
     hospitalized
    0.07
    ానం
    0.07
    shme
    0.07
    Act Density 0.022%

    No Known Activations