INDEX
Explanations
advertisements within content
instances of advertisements in the text
New Auto-Interp
Negative Logits
stood
-0.67
ties
-0.66
clus
-0.63
fulness
-0.62
ãĥ´
-0.61
wound
-0.61
mate
-0.61
perspect
-0.60
omission
-0.60
accessibility
-0.59
POSITIVE LOGITS
Continue
0.93
Advertisement
0.91
advertisement
0.84
Credit
0.75
Advertisement
0.74
Skip
0.71
usercontent
0.70
COURT
0.68
credit
0.68
sburg
0.66
Activations Density 0.021%