INDEX
    Explanations
    New Auto-Interp
    Negative Logits
    merce
    -0.89
    doms
    -0.80
    urities
    -0.68
    uli
    -0.67
    IFA
    -0.66
    tale
    -0.66
    ahime
    -0.66
    ancial
    -0.65
    roe
    -0.63
    yrinth
    -0.63
    POSITIVE LOGITS
     bottles
    1.21
     bottle
    1.14
     Bottle
    1.01
    Bott
    0.94
     Bott
    0.91
     opener
    0.86
     refill
    0.84
     cans
    0.83
     labelled
    0.83
     vodka
    0.80
    Act Density 0.021%

    No Known Activations