INDEX
    Explanations

    the word "poet" and its various forms and contexts

    New Auto-Interp
    Negative Logits
    ument
    -0.17
    kan
    -0.17
    dek
    -0.17
    bek
    -0.17
    ui
    -0.16
    desk
    -0.15
    wij
    -0.15
    uppy
    -0.15
    innen
    -0.15
    çĸĨ
    -0.15
    POSITIVE LOGITS
     Po
    0.20
    iesz
    0.20
    Po
    0.20
    isson
    0.19
     po
    0.19
    ached
    0.17
    entially
    0.17
    isons
    0.17
    ehler
    0.17
    etics
    0.16
    Act Density 0.017%

    No Known Activations