INDEX
    Explanations

    references to the concept of 'good'

    New Auto-Interp
    Negative Logits
    UGE
    -0.77
     Constantin
    -0.74
    BuyableInstoreAndOnline
    -0.73
     Ago
    -0.71
     MIA
    -0.69
    USE
    -0.68
    ASED
    -0.66
    chal
    -0.66
    UNCH
    -0.66
    asper
    -0.66
    POSITIVE LOGITS
    ood
    1.21
    edly
    0.93
    ed
    0.92
    iership
    0.91
    lers
    0.90
    rill
    0.89
    les
    0.88
    ividual
    0.87
    ler
    0.87
    skin
    0.86
    Act Density 0.016%

    No Known Activations