INDEX
    Explanations
    New Auto-Interp
    Negative Logits
    ەن
    0.68
    ങ്ങാ
    0.68
    0.65
    0.64
    0.61
    0.60
    𝕄
    0.59
    पतवार
    0.59
    0.59
     আহসান
    0.58
    POSITIVE LOGITS
     L
    2.23
     Л
    2.22
    2.22
    L
    2.20
    2.08
    2.03
    2.01
    2.01
     Ло
    1.99
     ل
    1.96
    Act Density 2.504%

    No Known Activations