INDEX
Explanations
verbs related to questioning or pushing back on established ideas
instances of the word "challenge" and its variations, indicating discussions about contesting ideas or authority
New Auto-Interp
Negative Logits
abet
-0.81
ng
-0.71
ufact
-0.68
]}
-0.66
ther
-0.66
ops
-0.65
··
-0.65
agh
-0.64
oped
-0.62
holm
-0.61
POSITIVE LOGITS
assumptions
1.15
stereotypes
1.12
precon
1.07
orthodoxy
0.97
misconceptions
0.96
perceptions
0.94
beliefs
0.93
myths
0.88
belief
0.83
assertions
0.82
Activations Density 0.168%