INDEX
    Explanations

    expressions of gratitude and enthusiasm related to teamwork and community involvement

    New Auto-Interp
    Negative Logits
    don
    -0.15
    iston
    -0.14
    isted
    -0.14
    λι
    -0.14
    öh
    -0.14
    æľī人
    -0.14
    avigation
    -0.13
    beer
    -0.13
    DON
    -0.13
    etwork
    -0.13
    POSITIVE LOGITS
     look
    0.71
     looking
    0.58
     Look
    0.57
    look
    0.55
     looks
    0.53
    Look
    0.51
     Looking
    0.50
     LOOK
    0.48
    .look
    0.46
    looking
    0.46
    Act Density 0.123%

    No Known Activations