INDEX
    Explanations

    specific references or terms in text

    phrases or terms that indicate reference or citation

    New Auto-Interp
    Negative Logits
     captcha
    -0.66
    azaki
    -0.63
     tomorrow
    -0.60
     every
    -0.60
     tonight
    -0.54
     marrow
    -0.54
     free
    -0.53
    iven
    -0.52
     worthwhile
    -0.51
     ju
    -0.51
    POSITIVE LOGITS
     refers
    3.60
     refer
    2.07
     denotes
    2.04
     describes
    1.93
     referred
    1.81
     relates
    1.78
     implies
    1.75
     referring
    1.67
     specifies
    1.66
     translates
    1.65
    Act Density 0.013%

    No Known Activations