INDEX
    Explanations

    elements associated with measurement and values in various contexts

    New Auto-Interp
    Negative Logits
    ãĥ«ãĥī
    -0.17
    oub
    -0.16
     doubly
    -0.15
    ipay
    -0.15
    iran
    -0.15
     Aviv
    -0.15
    ongan
    -0.15
    emin
    -0.15
     جÙħعÛĮت
    -0.15
     kne
    -0.14
    POSITIVE LOGITS
    MU
    0.17
     Mun
    0.17
     ÏĥοÏħ
    0.17
     MU
    0.17
    mun
    0.15
     Shake
    0.15
     Pen
    0.15
    ti
    0.14
    Âłm
    0.14
     ÐĿад
    0.14
    Act Density 0.044%

    No Known Activations