INDEX
Explanations
phrases related to advertisements
instances of advertisements
New Auto-Interp
Negative Logits
erva
-0.70
»Ĵ
-0.69
quartered
-0.68
ĪĴ
-0.66
stood
-0.66
borg
-0.64
folk
-0.63
cius
-0.63
roots
-0.62
ciplinary
-0.61
POSITIVE LOGITS
Thumbnails
0.95
Advertisement
0.79
VERTISEMENT
0.76
Interstitial
0.72
Skip
0.70
advertisement
0.69
advertisement
0.69
ļéĨĴ
0.68
toggle
0.66
Transcript
0.65
Activations Density 0.008%