INDEX
    Explanations
    New Auto-Interp
    Negative Logits
     qq
    -0.07
     barber
    -0.06
    792
    -0.06
     sharply
    -0.06
     Meghan
    -0.06
     врем
    -0.06
    ');↵↵↵
    -0.06
    gd
    -0.06
     eBooks
    -0.06
    .mark
    -0.06
    POSITIVE LOGITS
     lion
    0.08
     Lion
    0.08
    _initialize
    0.07
    (dialog
    0.07
    /:
    0.07
     sons
    0.07
     Lionel
    0.07
     conquest
    0.07
     POSIX
    0.07
     пози
    0.07
    Act Density 0.003%

    No Known Activations