INDEX
Explanations
references to medical conditions or physical injuries
New Auto-Interp
Neuron Alignment
Index
Value
% of L₁
196
+0.14
0.5%
1472
+0.12
0.5%
468
+0.12
0.5%
Correlated Neurons
Index
P. Corr.
Cos Sim.
196
+0.14
0.03
468
+0.12
0.03
1363
+0.12
0.03
Negative Logits
reluct
-0.92
suscep
-0.85
disreg
-0.83
erad
-0.82
increa
-0.79
nicolas
-0.78
inconce
-0.77
volunte
-0.76
affor
-0.76
maneu
-0.76
POSITIVE LOGITS
injury
1.30
injuries
1.21
Injury
1.16
injured
1.08
Injuries
1.05
Injury
1.05
injury
1.05
injured
0.94
Injuries
0.91
INJ
0.90
Activations Density 0.063%