INDEX
    Explanations

    phrases expressing gratitude and appreciation

    Following "the," often leading to positive experiences

    positive experiences or opportunities

    New Auto-Interp
    Negative Logits
    aarrggbb
    -0.74
     Majefty
    -0.72
    AddTagHelper
    -0.72
     Theſe
    -0.71
     ſche
    -0.70
     Conſ
    -0.70
     becauſe
    -0.70
     ſtate
    -0.70
    uxxxx
    -0.69
     ſtre
    -0.68
    POSITIVE LOGITS
     pleasure
    2.15
    pleasure
    1.86
     privilege
    1.71
     honor
    1.71
     honour
    1.52
     Pleasure
    1.50
    privilege
    1.40
     honored
    1.35
    honor
    1.32
     plaisir
    1.28
    Act Density 0.164%

    No Known Activations