INDEX
Explanations
references to institutions, organizations, and government entities related to health and environmental policies
New Auto-Interp
Neuron Alignment
Index
Value
% of L₁
285
+0.16
0.9%
271
+0.14
0.8%
181
+0.13
0.8%
Correlated Neurons
Index
P. Corr.
Cos Sim.
285
+0.16
0.08
181
+0.14
0.07
271
+0.13
0.00
Negative Logits
ľĵ
-2.60
¯
-2.54
Īĺ
-2.33
¿
-2.23
½
-2.14
ª
-2.13
º
-2.13
ĨĴ
-2.12
¶
-2.08
ı
-2.05
POSITIVE LOGITS
Medal
1.89
Republic
1.68
Labor
1.58
Nations
1.52
Amend
1.50
Awards
1.50
Institutes
1.49
Award
1.47
Veterans
1.45
men
1.45
Activations Density 0.694%