INDEX
Explanations
terms related to statistical concepts and methodologies
New Auto-Interp
Neuron Alignment
Index
Value
% of L₁
98
+0.22
1.3%
156
+0.13
0.7%
10
+0.12
0.7%
Correlated Neurons
Index
P. Corr.
Cos Sim.
10
+0.22
0.09
291
+0.13
0.08
98
+0.12
0.02
Negative Logits
æį®
-1.88
"}](#
-1.66
nai
-1.57
ignon
-1.46
âĶĢâĶĢâĶĢâĶĢ
-1.46
Sov
-1.42
ãĥ¼ãĥī
-1.39
μÎŃ
-1.38
âĶĢâĶĢâĶĢâĶĢâĶĢâĶĢâĶĢâĶĢ
-1.37
ãĥ¥
-1.37
POSITIVE LOGITS
ses
1.94
latter
1.78
aforementioned
1.75
following
1.66
value
1.65
orems
1.59
corresponding
1.59
overall
1.56
respectively
1.55
validity
1.54
Activations Density 0.820%