INDEX
    Explanations
    New Auto-Interp
    Negative Logits
     اس
    0.90
     أو
    0.87
     Because
    0.83
     Bahkan
    0.81
     Sometimes
    0.80
     Even
    0.79
     dlatego
    0.79
     ک
    0.79
     Muchos
    0.75
     lumea
    0.74
    POSITIVE LOGITS
    ुद्ध
    0.80
    </caption>
    0.79
    xlink
    0.74
    0.73
    mene
    0.71
    FA
    0.71
     وصحبه
    0.71
    मी
    0.70
    ียร์
    0.70
    ust
    0.69
    Act Density 2.709%

    No Known Activations