INDEX
Explanations
keywords related to public opinion and attitudes towards societal issues
New Auto-Interp
Neuron Alignment
Index
Value
% of L₁
1253
+0.11
0.3%
972
+0.08
0.2%
184
+0.08
0.2%
Correlated Neurons
Index
P. Corr.
Cos Sim.
728
+0.11
0.04
972
+0.08
0.04
887
+0.08
0.04
Negative Logits
affor
-1.38
swarovski
-1.38
depic
-1.27
perfet
-1.27
encomp
-1.26
eiffel
-1.26
casio
-1.26
maneu
-1.25
philanth
-1.25
accla
-1.25
POSITIVE LOGITS
respondents
0.83
+#+
0.72
majority
0.68
survey
0.65
WriteAttribute
0.64
surveyed
0.63
poll
0.62
camy
0.61
respondent
0.60
percentage
0.59
Activations Density 0.257%