INDEX
    Explanations
    New Auto-Interp
    Negative Logits
     takeaway
    -0.08
     Jamaican
    -0.07
    _prime
    -0.07
     prog
    -0.07
     coc
    -0.07
    (cp
    -0.07
    lios
    -0.07
    perc
    -0.07
    <|endoftext|>
    -0.07
     Pry
    -0.07
    POSITIVE LOGITS
    ิโน
    0.08
     missen
    0.08
     unfamiliar
    0.08
     irreversible
    0.08
     pesado
    0.08
     قط
    0.08
     ignorance
    0.07
     //--------------------------------
    0.07
    Ignored
    0.07
     смерть
    0.07
    Act Density 0.000%

    No Known Activations