INDEX
    Explanations

    phrases indicating the outcome or consequence of actions

    New Auto-Interp
    Negative Logits
    zos
    -0.16
    defs
    -0.16
     Flake
    -0.16
    actionDate
    -0.15
    aterno
    -0.15
    PointerException
    -0.15
    hek
    -0.14
    -fw
    -0.14
    à¸Ħว
    -0.14
    agog
    -0.14
    POSITIVE LOGITS
     we
    0.24
     tenemos
    0.23
     have
    0.19
    _HAVE
    0.18
    we
    0.18
    have
    0.17
    å¾Ĺåΰ
    0.17
    æľī
    0.17
    .we
    0.16
    æĪij们
    0.16
    Act Density 0.154%

    No Known Activations