INDEX
Explanations
advertisements within a text
instances of advertisements
New Auto-Interp
Negative Logits
vain
-0.78
contingency
-0.77
bunk
-0.75
clus
-0.74
sacr
-0.73
explor
-0.70
pred
-0.70
planned
-0.70
marsh
-0.69
corpor
-0.68
POSITIVE LOGITS
Advertisement
1.37
Advertisement
1.12
advertisement
1.01
});
0.93
taboola
0.90
ertodd
0.90
advertising
0.83
Images
0.83
Credit
0.82
Continue
0.80
Activations Density 0.013%