INDEX
    Explanations

    phrases indicating superiority or excellence

    phrases indicating the concept of "best" or superiority

    New Auto-Interp
    Negative Logits
    chy
    -0.66
    ngth
    -0.65
    idon
    -0.63
     Emer
    -0.62
    cking
    -0.61
    kers
    -0.61
    ascade
    -0.60
    IGHTS
    -0.59
     Reloaded
    -0.59
    kefeller
    -0.58
    POSITIVE LOGITS
    seller
    1.20
    iary
    1.09
    sell
    1.09
     suited
    1.08
    ow
    1.08
    owing
    1.07
    ows
    1.07
    iaries
    1.06
    ower
    1.06
    ial
    0.94
    Act Density 0.051%

    No Known Activations