INDEX
    Explanations

    points or arguments being made in a text

    references to key points being made in discussions or arguments

    New Auto-Interp
    Negative Logits
    DAQ
    -0.86
    reditary
    -0.84
    uthor
    -0.84
     destro
    -0.82
     notor
    -0.80
    eatures
    -0.78
    apons
    -0.77
    eco
    -0.75
    undai
    -0.75
    emale
    -0.74
    POSITIVE LOGITS
    lessly
    0.93
    point
    0.89
    points
    0.88
    blank
    0.82
    posted
    0.82
    forward
    0.82
    lessness
    0.77
     deduction
    0.75
    posts
    0.75
    iasis
    0.74
    Act Density 0.033%

    No Known Activations