INDEX
    Explanations

    instances of the word "Say" and variations of it in different contexts

    New Auto-Interp
    Negative Logits
    eration
    -0.07
    ugh
    -0.07
    orf
    -0.06
    yte
    -0.06
    bery
    -0.06
    ics
    -0.06
    china
    -0.06
    luv
    -0.06
    sand
    -0.06
     >č↵
    -0.06
    POSITIVE LOGITS
     STDERR
    0.08
    dney
    0.07
    agues
    0.07
    -alist
    0.07
    dust
    0.07
    eg
    0.07
    gili
    0.07
    onnement
    0.07
    ings
    0.07
    един
    0.07
    Act Density 0.012%

    No Known Activations