INDEX
    Explanations

    advertisement content within the text

    instances of advertisements

    New Auto-Interp
    Negative Logits
    ties
    -0.69
    Ͻ
    -0.65
     makeshift
    -0.63
     perspect
    -0.63
    mate
    -0.62
    graded
    -0.60
     retri
    -0.60
    fulness
    -0.58
     wound
    -0.58
     contingent
    -0.58
    POSITIVE LOGITS
     Continue
    1.00
     Advertisement
    0.97
    advertisement
    0.80
    Advertisement
    0.72
    Skip
    0.71
    usercontent
    0.70
    Credit
    0.68
    ieu
    0.68
    ADVERTISEMENT
    0.67
    sburg
    0.67
    Act Density 0.026%

    No Known Activations