INDEX
Explanations
mentions of advertisements or ads
mentions of advertisements and related terms
New Auto-Interp
Negative Logits
ĨĴ
-0.74
Carbuncle
-0.72
@#&
-0.68
Jr
-0.67
20439
-0.67
disciplinary
-0.66
bred
-0.66
contingency
-0.66
displayText
-0.65
uador
-0.64
POSITIVE LOGITS
vertising
1.15
verts
1.12
orial
1.03
vertisement
1.00
aired
0.99
ieu
0.95
idas
0.95
strip
0.93
ads
0.92
airing
0.89
Activations Density 0.044%