INDEX
    Explanations

    words related to actions and descriptions of characters in a narrative

    New Auto-Interp
    Negative Logits
    utin
    -0.17
    ensed
    -0.16
    ène
    -0.16
    USH
    -0.16
    ãĥ¬ãĥĥãĥĪ
    -0.16
    éli
    -0.15
    ocos
    -0.15
    -icons
    -0.15
    rowable
    -0.15
    ipay
    -0.15
    POSITIVE LOGITS
    ä
    0.21
    ie
    0.20
    246
    0.20
    ö
    0.20
    ü
    0.18
    age
    0.18
    âĶ
    0.18
    ei
    0.18
    au
    0.17
    ür
    0.17
    Act Density 0.123%

    No Known Activations