INDEX
    Explanations
    New Auto-Interp
    Negative Logits
     retired
    -0.08
     ass
    -0.07
     mush
    -0.07
    mr
    -0.07
    lie
    -0.07
     retire
    -0.07
    stead
    -0.07
     mediated
    -0.07
    -0.07
     dismissal
    -0.07
    POSITIVE LOGITS
    .quote
    0.08
    出来
    0.08
     glimps
    0.08
     Rid
    0.08
    0.08
     fucking
    0.08
     quote
    0.08
    _quote
    0.07
     jaz
    0.07
     தெர
    0.07
    Act Density 0.017%

    No Known Activations