INDEX
Explanations
mentions of advertisements
terms related to advertising and promotional content
New Auto-Interp
Negative Logits
uckle
-0.80
steen
-0.74
20439
-0.70
kson
-0.66
ologically
-0.66
come
-0.65
Morrow
-0.64
borg
-0.63
izers
-0.60
opez
-0.60
POSITIVE LOGITS
vertising
1.07
Advertisement
0.99
culosis
0.86
allery
0.81
VERTISEMENT
0.80
advertisement
0.76
FontSize
0.75
eering
0.74
agascar
0.74
elaide
0.74
Activations Density 0.020%