INDEX
Explanations
numerical data or statistics
New Auto-Interp
Neuron Alignment
Index
Value
% of L₁
128
+0.14
0.8%
125
+0.13
0.7%
352
+0.11
0.6%
Correlated Neurons
Index
P. Corr.
Cos Sim.
133
+0.14
0.03
128
+0.13
0.02
69
+0.11
0.04
Negative Logits
idi
-1.62
ridges
-1.60
eness
-1.56
fork
-1.54
upside
-1.54
river
-1.47
line
-1.42
éd
-1.42
tighter
-1.40
chain
-1.40
POSITIVE LOGITS
00
1.59
cember
1.52
ffen
1.50
ophers
1.44
eland
1.43
venth
1.42
еÐ
1.41
adier
1.39
ratings
1.38
xture
1.37
Activations Density 0.208%