INDEX
    Explanations

    positive adjectives describing something favorable or beneficial

    phrases that express positive evaluations or affirmations regarding various subjects

    New Auto-Interp
    Negative Logits
    racuse
    -0.79
    ategory
    -0.79
    opers
    -0.73
    ipient
    -0.73
    hyde
    -0.71
    ancies
    -0.70
    eters
    -0.69
    olson
    -0.67
    ppe
    -0.66
    ardy
    -0.66
    POSITIVE LOGITS
     enough
    1.05
    enough
    0.88
    bye
    0.81
     fodder
    0.80
     consolation
    0.78
     synergy
    0.78
     news
    0.77
     surpr
    0.77
     luck
    0.76
     Enough
    0.76
    Act Density 0.114%

    No Known Activations