INDEX
    Explanations

    instances of the word "hum" and its variations, indicating a focus on humility or humility-related concepts

    New Auto-Interp
    Negative Logits
    yonel
    -0.19
    hell
    -0.17
    ncia
    -0.17
    upo
    -0.17
    anje
    -0.16
    енÑĮ
    -0.15
    adem
    -0.15
    legate
    -0.15
    oler
    -0.15
    Ïĩή
    -0.15
    POSITIVE LOGITS
    pty
    0.31
    ankind
    0.28
    bug
    0.27
    iliated
    0.27
    iliate
    0.26
    mock
    0.25
    mus
    0.25
    mers
    0.24
    mer
    0.23
    bling
    0.23
    Act Density 0.008%

    No Known Activations