INDEX
Explanations
mentions of advertisements, especially with strong emotional or impactful connotations
mentions of advertisements
New Auto-Interp
Negative Logits
Wright
-0.67
speculative
-0.66
Myers
-0.62
survival
-0.61
Hou
-0.60
Purg
-0.58
tandem
-0.57
vette
-0.57
sclerosis
-0.57
hold
-0.56
POSITIVE LOGITS
igmatic
1.08
ads
1.04
ync
1.00
oras
0.98
enza
0.96
venture
0.95
apter
0.95
olph
0.94
vertisements
0.89
ules
0.86
Activations Density 0.007%