INDEX
Explanations
phrases related to widespread impact or acceptance
the term "widespread" and related consistent mentions in various contexts
New Auto-Interp
Negative Logits
estamp
-0.81
men
-0.75
pent
-0.74
ttle
-0.73
udging
-0.71
eters
-0.71
woman
-0.70
acters
-0.69
asus
-0.69
=-=-
-0.69
POSITIVE LOGITS
adoption
1.03
misconception
1.02
occurrence
0.93
acceptance
0.91
misinformation
0.90
belief
0.89
dissemination
0.86
condemnation
0.86
misconceptions
0.85
widespread
0.84
Activations Density 0.063%