INDEX
    Explanations

    references to historical context or events

    New Auto-Interp
    Negative Logits
    umd
    -0.15
    vig
    -0.15
    pn
    -0.14
    elm
    -0.13
    mw
    -0.13
    nout
    -0.13
    gid
    -0.13
    yll
    -0.13
    uy
    -0.13
    no
    -0.13
    POSITIVE LOGITS
     history
    0.22
    /history
    0.19
    -history
    0.18
    history
    0.17
     History
    0.17
    ãĥĥãĤ·ãĥ¥
    0.17
    smarty
    0.16
    -addon
    0.15
     иÑģÑĤоÑĢии
    0.15
    à¹ģà¸ŀ
    0.15
    Act Density 0.090%

    No Known Activations