INDEX
Explanations
phrases related to medical research and health care initiatives
New Auto-Interp
Neuron Alignment
Index
Value
% of L₁
449
+0.14
0.8%
427
+0.12
0.7%
15
+0.12
0.7%
Correlated Neurons
Index
P. Corr.
Cos Sim.
15
+0.14
0.05
410
+0.12
0.02
427
+0.12
0.05
Negative Logits
ļ
-2.22
Ĥ
-2.20
ij
-2.19
£
-2.18
ģ
-2.14
¡
-2.09
Ł
-2.02
Īĺ
-2.00
ĸ
-1.95
·
-1.94
POSITIVE LOGITS
]:
1.95
ks
1.60
etin
1.48
unnumbered
1.44
Aug
1.43
eas
1.41
maxima
1.40
urus
1.40
,'"
1.38
extends
1.37
Activations Density 0.871%