INDEX
Explanations
statements or arguments being emphasized or highlighted in text
New Auto-Interp
Neuron Alignment
Index
Value
% of L₁
1376
+0.15
0.6%
1085
+0.13
0.5%
1052
+0.13
0.5%
Correlated Neurons
Index
P. Corr.
Cos Sim.
1052
+0.15
0.05
1376
+0.13
0.05
1085
+0.13
0.04
Negative Logits
õ
-0.45
roh
-0.43
glise
-0.43
Loire
-0.41
braio
-0.41
Portály
-0.41
MatIconModule
-0.40
gida
-0.40
Crew
-0.40
dè
-0.40
POSITIVE LOGITS
point
1.32
point
1.31
POINT
1.29
Point
1.22
Point
1.20
points
1.19
points
1.18
POINT
1.14
POINTS
1.11
Points
1.08
Activations Density 0.120%