INDEX
    Explanations

    expressions of personal feelings and opinions

    New Auto-Interp
    Negative Logits
     then
    -0.06
    ibe
    -0.06
    ombres
    -0.06
    oking
    -0.06
     Mixed
    -0.06
     Operating
    -0.06
    uce
    -0.06
     Dudley
    -0.06
    MBOL
    -0.06
     Seek
    -0.05
    POSITIVE LOGITS
    elpers
    0.07
    iglia
    0.07
     doubly
    0.07
    .generated
    0.06
     zwar
    0.06
    olini
    0.06
    stitial
    0.06
    ulin
    0.06
    itin
    0.06
    zure
    0.06
    Act Density 0.030%

    No Known Activations