INDEX
    Explanations

    references to "world" in various contexts, indicating a focus on global or universal themes

    New Auto-Interp
    Negative Logits
    elor
    -0.18
    eters
    -0.16
    ases
    -0.16
    imson
    -0.16
    ábado
    -0.14
    atures
    -0.14
    imap
    -0.14
    è¼Ŀ
    -0.14
    mons
    -0.14
    oter
    -0.13
    POSITIVE LOGITS
    -wide
    0.30
    liness
    0.26
    Wide
    0.25
    wide
    0.24
     wide
    0.23
    views
    0.23
     Wide
    0.22
    -ren
    0.19
    /world
    0.18
    Ú¯ÛĮر
    0.17
    Act Density 0.094%

    No Known Activations