INDEX
    Explanations

    phrases of contradiction or opposition

    New Auto-Interp
    Negative Logits
     Contains
    -0.74
    Released
    -0.67
    Reviewed
    -0.66
    Requires
    -0.65
    Synopsis
    -0.64
    PDF
    -0.61
    CLAIM
    -0.60
     Released
    -0.60
    Introduction
    -0.60
    ublished
    -0.59
    POSITIVE LOGITS
     yeah
    1.22
     secondly
    1.20
     everybody
    1.09
    entimes
    1.04
     somebody
    1.04
     nobody
    1.02
     we
    1.01
     I
    1.00
     luckily
    0.99
     [
    0.99
    Act Density 0.303%

    No Known Activations