INDEX
    Explanations

    repetitions of the word "once."

    New Auto-Interp
    Negative Logits
    -Semit
    -0.16
    lrt
    -0.16
    wdx
    -0.16
     gratuiti
    -0.15
    linger
    -0.15
    ltk
    -0.15
    arken
    -0.15
    .esp
    -0.15
    uctions
    -0.14
    acios
    -0.14
    POSITIVE LOGITS
     gain
    0.34
    gain
    0.29
     Gain
    0.28
     ag
    0.27
    Gain
    0.25
    ag
    0.23
     AN
    0.22
    _gain
    0.21
     against
    0.20
    -ag
    0.19
    Act Density 0.011%

    No Known Activations