INDEX
    Explanations

    phrases related to gratitude and appreciation

    references to a collective or generalized group

    New Auto-Interp
    Negative Logits
     Kamp
    -0.76
    illac
    -0.66
    abwe
    -0.62
    yip
    -0.62
    pu
    -0.59
    potion
    -0.59
    fman
    -0.59
     Presence
    -0.58
    kamp
    -0.57
     Peninsula
    -0.57
    POSITIVE LOGITS
     sorts
    1.19
     kinds
    1.17
    ocating
    1.16
    igators
    0.95
    iances
    0.94
    usions
    0.94
    igator
    0.89
    uv
    0.84
    ogene
    0.83
    uding
    0.83
    Act Density 0.114%

    No Known Activations