INDEX
    Explanations
    New Auto-Interp
    Negative Logits
    ignored
    -0.06
    agh
    -0.06
    enance
    -0.06
    ủa
    -0.06
    -0.06
    -0.06
    oruč
    -0.06
    ょう
    -0.06
    يث
    -0.06
     courteous
    -0.06
    POSITIVE LOGITS
    0.06
    医院
    0.06
     municip
    0.06
     IPS
    0.06
    leground
    0.06
    ้ย
    0.06
     Daughter
    0.06
    (plan
    0.06
     goof
    0.06
     socioeconomic
    0.06
    Act Density 0.099%

    No Known Activations