INDEX
    Explanations

    occurrences of the word "przez."

    New Auto-Interp
    Negative Logits
    ajar
    -0.16
    aje
    -0.15
    omm
    -0.15
    ulum
    -0.15
    ande
    -0.14
    ahat
    -0.14
    ouve
    -0.14
    fers
    -0.14
    cheid
    -0.14
    Convention
    -0.14
    POSITIVE LOGITS
    ιÏİν
    0.15
    otron
    0.14
     orth
    0.14
    orque
    0.14
    -gnu
    0.14
    daq
    0.13
    urally
    0.13
    ÛĮاÙĨ
    0.13
    sse
    0.13
    ersist
    0.13
    Act Density 0.001%

    No Known Activations