INDEX
    Explanations
    New Auto-Interp
    Negative Logits
     Armstrong
    -0.09
    ‌ی
    -0.08
     forwarding
    -0.08
     samtidig
    -0.08
    ĵ
    -0.08
    .Col
    -0.08
     nud
    -0.08
     Zimbabwe
    -0.07
     Tek
    -0.07
    ording
    -0.07
    POSITIVE LOGITS
     san
    0.08
     SO
    0.08
     ac
    0.07
    යි
    0.07
     sparkling
    0.07
     तरह
    0.07
     Luxury
    0.07
    842
    0.07
    obin
    0.07
     inil
    0.07
    Act Density 0.001%

    No Known Activations