INDEX
    Explanations
    New Auto-Interp
    Negative Logits
     alternatively
    0.49
     внутріш
    0.45
    гія
    0.45
    єте
    0.45
    ชน์
    0.44
     dvi
    0.42
    Particular
    0.42
    Owing
    0.41
    𝕝
    0.41
     গুরুদেব
    0.41
    POSITIVE LOGITS
     wannan
    0.75
     da
    0.74
     daga
    0.74
     cikin
    0.70
     kamar
    0.68
     kuwa
    0.67
     suna
    0.67
     ya
    0.65
     kuma
    0.65
     ƙ
    0.64
    Act Density 0.000%

    No Known Activations