INDEX
Explanations
terms related to regulatory policies and awards in the context of performance recognition
New Auto-Interp
Neuron Alignment
Index
Value
% of L₁
204
+0.15
0.9%
159
+0.14
0.8%
122
+0.14
0.8%
Correlated Neurons
Index
P. Corr.
Cos Sim.
122
+0.15
0.10
204
+0.14
0.14
233
+0.14
0.08
Negative Logits
jurisdiction
-1.44
plete
-1.35
inheritance
-1.35
@@
-1.32
atinib
-1.31
ies
-1.30
plicial
-1.29
chance
-1.29
<!
-1.26
estate
-1.26
POSITIVE LOGITS
ĻĤ
1.89
ľĵ
1.88
Ŀ
1.86
ĸ
1.84
ľ
1.81
¤
1.74
Ł
1.72
İ
1.71
º
1.71
ŀ
1.71
Activations Density 4.903%