INDEX
    Explanations

    immediate or initial state of being

    New Auto-Interp
    Negative Logits
     הת
    0.53
     آثار
    0.52
    ຜະລິດຕະພັນ
    0.51
     متعلقه
    0.50
    ޙ
    0.50
     événements
    0.49
    Desen
    0.49
    0.49
     המח
    0.48
    0.48
    POSITIVE LOGITS
    re
    0.61
    ur
    0.51
    il
    0.49
    el
    0.47
     Retail
    0.47
    cgi
    0.46
    am
    0.46
    ad
    0.46
    ul
    0.46
    ou
    0.46
    Act Density 0.001%

    No Known Activations