INDEX
Explanations
terms related to advocating or endorsing different causes or products
expressions related to promotion and advocacy
New Auto-Interp
Negative Logits
psons
-0.73
ft
-0.72
plets
-0.67
gging
-0.66
Reserv
-0.66
Manor
-0.64
dra
-0.63
vette
-0.63
thing
-0.63
dal
-0.62
POSITIVE LOGITS
promoting
1.05
promotes
0.99
promotion
0.99
promote
0.92
dissemin
0.88
ourage
0.83
Promotion
0.82
promoted
0.78
incent
0.74
ãħĭ
0.74
Activations Density 0.018%