INDEX
    Explanations

    instances of the word "somehow" and expressions of confusion or unexpected outcomes

    New Auto-Interp
    Negative Logits
    endale
    -0.17
    ãģĹãĤĩ
    -0.16
    isoft
    -0.16
    uely
    -0.15
    ernel
    -0.15
    utzer
    -0.15
    asto
    -0.14
     Armour
    -0.14
     Interracial
    -0.14
     Ross
    -0.14
    POSITIVE LOGITS
     somehow
    0.32
     somew
    0.32
     Somehow
    0.23
     magically
    0.18
     manages
    0.15
    LLL
    0.15
     managing
    0.15
     somewhere
    0.15
    qi
    0.14
     mirac
    0.14
    Act Density 0.028%

    No Known Activations