INDEX
    Explanations

    foreign language

    New Auto-Interp
    Negative Logits
    xp
    -0.09
     Edel
    -0.08
     Bones
    -0.08
    ּ
    -0.08
    cheap
    -0.08
     Rh
    -0.08
    -0.08
    -0.07
    npc
    -0.07
    ROOT
    -0.07
    POSITIVE LOGITS
     тем
    0.09
     تو
    0.09
     ق
    0.08
     versi
    0.08
     تط
    0.08
    เพ
    0.08
    0.08
    0.08
     fundamentally
    0.07
     contribu
    0.07
    Act Density 0.091%

    No Known Activations