INDEX
Explanations
words related to promoting or advertising something
terms associated with promotional language or marketing
New Auto-Interp
Negative Logits
Ct
-0.69
ustration
-0.66
emort
-0.66
Pen
-0.65
reckoning
-0.65
pond
-0.60
alde
-0.60
bj
-0.58
Pg
-0.58
arf
-0.57
POSITIVE LOGITS
virtues
1.18
successes
0.87
endorsements
0.86
superiority
0.83
stunts
0.79
benefits
0.77
praises
0.77
products
0.77
promises
0.77
credentials
0.77
Activations Density 0.206%