INDEX
    Explanations

    commentary or annotations in the text

    New Auto-Interp
    Negative Logits
    ----------------------------------------------------------------
    -0.17
    --------------------------------------------------------------------------------
    -0.16
    uchos
    -0.16
    opoulos
    -0.16
    ================================================================
    -0.16
    inel
    -0.15
    ------------------------------------------------
    -0.15
    recogn
    -0.14
    ugins
    -0.14
    sg
    -0.14
    POSITIVE LOGITS
    s
    0.21
    TODO
    0.18
    ÎŃÏģγ
    0.14
    ÐĴÐŀ
    0.14
    ÏĤ
    0.14
    å»Ĭ
    0.14
    ipa
    0.13
     Levy
    0.13
    sian
    0.13
    zá
    0.13
    Act Density 0.099%

    No Known Activations