INDEX
    Explanations

    mentions of the word "ay" and its variations

    New Auto-Interp
    Negative Logits
    sy
    -0.21
    د
    -0.20
    su
    -0.19
    s
    -0.19
    erer
    -0.19
    erator
    -0.18
    sell
    -0.17
    sch
    -0.17
    er
    -0.16
    sen
    -0.16
    POSITIVE LOGITS
    urved
    0.23
    enne
    0.23
    yyyy
    0.23
    yyy
    0.21
    eur
    0.20
    den
    0.20
    YYYY
    0.20
    ton
    0.20
    eb
    0.20
    oncé
    0.20
    Act Density 0.098%

    No Known Activations