INDEX
    Explanations

    instances of the word "wonder" and its derivatives, indicating a focus on curiosity and contemplation

    New Auto-Interp
    Negative Logits
    DataManager
    -0.15
    ssue
    -0.14
    edb
    -0.14
    γγ
    -0.14
    redo
    -0.14
    绾
    -0.13
    ces
    -0.13
    she
    -0.13
    wy
    -0.13
    ulated
    -0.13
    POSITIVE LOGITS
    atoria
    0.21
    ÑĢаÑģÑĤ
    0.16
    ous
    0.15
    ocks
    0.14
    ocker
    0.14
    ala
    0.14
    haf
    0.14
    oad
    0.14
    anka
    0.14
    ziej
    0.14
    Act Density 0.011%

    No Known Activations