INDEX
    Explanations

    phrases or sentences ending with punctuation like period or comma followed by a high emotion word

    punctuation indicating the end of statements or sentences

    New Auto-Interp
    Negative Logits
    iden
    -0.69
    ighed
    -0.67
    ãĥĭ
    -0.65
    ®
    -0.64
    IRO
    -0.62
    ==
    -0.61
    umat
    -0.60
    NRS
    -0.59
    Hub
    -0.59
    Frame
    -0.58
    POSITIVE LOGITS
    Interstitial
    1.20
    -'
    0.86
     ',
    0.84
    taboola
    0.84
    .'
    0.82
    emouth
    0.80
    Cause
    0.80
    til
    0.78
     '.
    0.78
    tis
    0.76
    Act Density 0.036%

    No Known Activations