INDEX
    Explanations
    New Auto-Interp
    Negative Logits
    METHOD
    -0.74
    tools
    -0.71
    film
    -0.69
    sticks
    -0.67
    henko
    -0.65
    hops
    -0.65
    cloth
    -0.64
    aceutical
    -0.64
    accompan
    -0.63
    biased
    -0.63
    POSITIVE LOGITS
    rium
    1.25
     dusk
    0.90
    las
    0.90
     least
    0.89
     Mile
    0.87
     Sunrise
    0.82
    yp
    0.81
     sunset
    0.80
     weekends
    0.80
     Fort
    0.79
    Act Density 0.099%

    No Known Activations