INDEX
    Explanations

    expressions of appreciation or gratitude towards content creators

    New Auto-Interp
    Negative Logits
     Alive
    -0.16
    bor
    -0.15
    ogl
    -0.15
    ÙĪØ·
    -0.15
    rippling
    -0.13
    Alive
    -0.13
     Sans
    -0.13
    اÙĦØ¥ÙĨجÙĦÙĬزÙĬØ©
    -0.13
    orns
    -0.13
    'field
    -0.13
    POSITIVE LOGITS
     informational
    0.17
     article
    0.17
     information
    0.16
    áu
    0.15
    nÃŃ
    0.15
     posting
    0.15
     share
    0.15
     informative
    0.15
     Nice
    0.14
     Great
    0.14
    Act Density 0.030%

    No Known Activations