INDEX
    Explanations

    instances of the word "even," suggesting a focus on emphasizing unexpected or contrasting situations

    New Auto-Interp
    Negative Logits
     именно
    -0.17
     also
    -0.17
     Guth
    -0.16
     superf
    -0.15
     only
    -0.14
    alth
    -0.14
     not
    -0.14
    Declare
    -0.14
    повÑĸд
    -0.14
    te
    -0.14
    POSITIVE LOGITS
    omid
    0.17
    bedo
    0.15
     necessarily
    0.15
    mium
    0.14
    hint
    0.14
     remot
    0.14
    677
    0.14
    iese
    0.14
    kowski
    0.14
     بÙĪØ§Ø¨Ø©
    0.14
    Act Density 0.043%

    No Known Activations