INDEX
Explanations
advertisements
instances of advertisements
New Auto-Interp
Negative Logits
clus
-0.78
ties
-0.71
stood
-0.70
mate
-0.68
contingency
-0.66
sacr
-0.66
isolation
-0.66
cluded
-0.65
wald
-0.64
perspect
-0.63
POSITIVE LOGITS
Advertisement
0.93
Continue
0.89
Advertisement
0.86
advertisement
0.85
Credit
0.83
Skip
0.76
Images
0.73
credit
0.72
Thumbnails
0.69
Image
0.67
Activations Density 0.022%