INDEX
    Explanations

    phrases related to reader engagement and the encouragement to interact with content

    New Auto-Interp
    Negative Logits
    jab
    -0.16
    ufen
    -0.16
     matchmaking
    -0.15
    offset
    -0.15
    owan
    -0.14
     Latter
    -0.14
    jeta
    -0.14
    idente
    -0.14
    eldorf
    -0.14
    eden
    -0.14
    POSITIVE LOGITS
    Gün
    0.17
    ãĤ¿ãĥ³
    0.14
    ONGL
    0.14
    ائÙģ
    0.14
     поÑħож
    0.14
    pron
    0.14
    iaux
    0.14
    vise
    0.13
    å¾Ħ
    0.13
    Far
    0.13
    Act Density 0.029%

    No Known Activations