INDEX
Explanations
instances of the term "advertising" and related concepts
New Auto-Interp
Negative Logits
dan
-0.17
風
-0.15
advertised
-0.15
ay
-0.14
phant
-0.14
ifications
-0.14
votes
-0.14
ton
-0.14
advertise
-0.14
-door
-0.14
POSITIVE LOGITS
hoc
0.25
/mark
0.21
campaigns
0.21
/prom
0.20
orial
0.20
orption
0.20
hoc
0.19
aption
0.19
jectives
0.19
-supported
0.18
Activations Density 0.019%