INDEX
Explanations
elements and structures moving through physical or metaphorical constructions
New Auto-Interp
Neuron Alignment
Index
Value
% of L₁
156
+0.18
1.0%
159
+0.18
1.0%
34
+0.13
0.7%
Correlated Neurons
Index
P. Corr.
Cos Sim.
159
+0.18
0.13
73
+0.18
0.05
277
+0.13
0.09
Negative Logits
olars
-1.46
quo
-1.41
urrent
-1.38
hostage
-1.30
alities
-1.30
onomy
-1.29
acted
-1.28
OSE
-1.27
transmitted
-1.27
.^\[[@
-1.23
POSITIVE LOGITS
Studio
1.56
%{1.50
fres
1.44
studio
1.37
ew
1.36
autom
1.34
closures
1.33
pipelines
1.31
Levi
1.31
engineers
1.31
Activations Density 5.182%