INDEX
    Explanations
    New Auto-Interp
    Negative Logits
     TestData
    -0.07
    리즈
    -0.07
     secondo
    -0.06
    illis
    -0.06
    _markers
    -0.06
    inkel
    -0.06
    :white
    -0.06
    achelor
    -0.06
    redict
    -0.06
    -checkbox
    -0.06
    POSITIVE LOGITS
     vegan
    0.14
     Vegan
    0.12
    GAN
    0.07
    0.07
    xic
    0.06
     Evan
    0.06
     cuda
    0.06
    916
    0.06
     Professional
    0.06
    egan
    0.06
    Act Density 0.001%

    No Known Activations