INDEX
    Explanations
    New Auto-Interp
    Negative Logits
    j
    1.34
    ف
    1.34
    og
    1.28
    f
    1.27
    uiDesigner
    1.22
    أ
    1.22
    ou
    1.20
    1.19
    িন
    1.15
    1.15
    POSITIVE LOGITS
    lardan
    1.30
    larda
    1.18
     시절
    1.05
    다는
    1.02
     had
    0.98
     couldn
    0.98
     didn
    0.97
     вариантов
    0.97
    ruled
    0.96
    গুলো
    0.96
    Act Density 0.088%

    No Known Activations