INDEX
    Explanations

    keywords related to giving advice or suggestions

    phrases that offer advice or recommendations

    New Auto-Interp
    Negative Logits
    ufact
    -0.75
    eals
    -0.69
    ords
    -0.68
    ruciating
    -0.67
     Palest
    -0.65
     Cav
    -0.65
    lihood
    -0.65
    minist
    -0.64
    ipment
    -0.63
    yss
    -0.61
    POSITIVE LOGITS
     tips
    1.24
     tip
    1.09
    tip
    1.07
    Tip
    1.05
    tips
    1.02
    Tips
    1.01
     Tips
    0.92
    heet
    0.92
     tipping
    0.86
     Tip
    0.77
    Act Density 0.013%

    No Known Activations