INDEX
    Explanations

    dialogue and quotations in texts

    New Auto-Interp
    Negative Logits
    iw
    -0.17
    lef
    -0.15
    RIES
    -0.14
    overy
    -0.14
     Rib
    -0.14
    iris
    -0.14
     Oswald
    -0.14
    usch
    -0.13
    SSI
    -0.13
    annis
    -0.13
    POSITIVE LOGITS
    é¹
    0.14
    θεν
    0.13
    ék
    0.13
     buckle
    0.13
    ople
    0.13
     Mayer
    0.13
    ัมà¸ŀ
    0.13
     loose
    0.13
    ister
    0.13
    iges
    0.13
    Act Density 0.237%

    No Known Activations