INDEX
    Explanations

    expressions of appreciation or gratitude

    expressions of gratitude and appreciation

    New Auto-Interp
    Negative Logits
    infect
    -0.87
    buster
    -0.83
    rooms
    -0.74
    prep
    -0.74
    soDeliveryDate
    -0.71
    metal
    -0.71
    shr
    -0.71
    ridden
    -0.71
    ãĥĥãĥī
    -0.68
    idem
    -0.68
    POSITIVE LOGITS
    ĸļ
    0.98
     appreciation
    0.85
     contributions
    0.78
     compliments
    0.76
    ably
    0.76
    ments
    0.74
     appreciate
    0.73
     advances
    0.72
     gifts
    0.71
    ¿½
    0.71
    Act Density 0.030%

    No Known Activations