INDEX
    Explanations
    New Auto-Interp
    Negative Logits
    Rename
    -0.08
     charm
    -0.08
    .rename
    -0.08
     Carta
    -0.08
     Rename
    -0.08
    rename
    -0.08
     encant
    -0.08
     importantes
    -0.07
     Blatt
    -0.07
     Rent
    -0.07
    POSITIVE LOGITS
    D
    0.08
     FG
    0.07
    0.07
    еди
    0.07
    Ideas
    0.07
     fes
    0.07
    >D
    0.07
    (Post
    0.07
     jin
    0.06
    (F
    0.06
    Act Density 0.004%

    No Known Activations