INDEX
    Explanations

    closing punctuation marks, specifically brackets and parentheses

    New Auto-Interp
    Negative Logits
    Against
    -0.74
    Nap
    -0.69
    con
    -0.67
    witch
    -0.63
    ORE
    -0.61
    Constructed
    -0.60
    "-
    -0.60
    pop
    -0.60
    Prem
    -0.60
    "…
    -0.59
    POSITIVE LOGITS
    ��
    0.80
    xual
    0.74
     Samar
    0.74
    ��
    0.68
    nesday
    0.66
    ernel
    0.65
    ���
    0.65
     Volunteers
    0.63
    iggs
    0.63
    retty
    0.62
    Act Density 0.141%

    No Known Activations