INDEX
    Explanations

    pas followed by os or French negation

    New Auto-Interp
    Negative Logits
    eh
    -0.10
    uff
    -0.10
    tras
    -0.10
    pine
    -0.10
    hem
    -0.10
    eted
    -0.09
    helm
    -0.09
    ÂŃi
    -0.09
    tring
    -0.09
    pector
    -0.09
    POSITIVE LOGITS
    sthrough
    0.19
    adena
    0.15
    ible
    0.13
    ively
    0.12
    ionate
    0.12
    sth
    0.12
    SED
    0.12
    engers
    0.11
    enger
    0.11
    ION
    0.11
    Act Density 0.015%

    No Known Activations