INDEX
Explanations
advertisement content within the text
instances of advertisements
New Auto-Interp
Negative Logits
ties
-0.69
Ͻ
-0.65
makeshift
-0.63
perspect
-0.63
mate
-0.62
graded
-0.60
retri
-0.60
fulness
-0.58
wound
-0.58
contingent
-0.58
POSITIVE LOGITS
Continue
1.00
Advertisement
0.97
advertisement
0.80
Advertisement
0.72
Skip
0.71
usercontent
0.70
Credit
0.68
ieu
0.68
ADVERTISEMENT
0.67
sburg
0.67
Activations Density 0.026%