INDEX
    Explanations

    instances of the word "prompt" or its variants

    New Auto-Interp
    Negative Logits
    igi
    -0.17
    iesel
    -0.16
    wi
    -0.15
    ERN
    -0.14
    hou
    -0.14
    apo
    -0.14
     bre
    -0.14
    lect
    -0.14
     Bold
    -0.14
    lector
    -0.14
    POSITIVE LOGITS
    æĿIJ
    0.16
    conditions
    0.15
    oron
    0.15
    zilla
    0.15
    .generated
    0.14
    stit
    0.14
    oko
    0.14
    blem
    0.14
    lém
    0.14
    sticks
    0.14
    Act Density 0.009%

    No Known Activations