INDEX
    Explanations

    phrases related to things that have been demonstrated or confirmed as factual or effective

    phrases emphasizing evidence of reliability or established success

    New Auto-Interp
    Negative Logits
    adish
    -0.73
     newsletters
    -0.68
    ifle
    -0.67
    idays
    -0.66
     sshd
    -0.64
    iewicz
    -0.64
    letal
    -0.64
    umbn
    -0.63
    hhh
    -0.63
    eeper
    -0.63
    POSITIVE LOGITS
    ãĥ¼ãĥĨ
    0.91
     proven
    0.87
    iary
    0.84
    uable
    0.81
    \\\\\\\\
    0.79
    س
    0.78
    ance
    0.77
    د
    0.73
    ãĤ¤ãĥĪ
    0.72
     debunked
    0.71
    Act Density 0.019%

    No Known Activations