INDEX
Explanations
references to specific values and statistics
New Auto-Interp
Neuron Alignment
Index
Value
% of L₁
187
+0.14
0.8%
156
+0.14
0.8%
275
+0.10
0.6%
Correlated Neurons
Index
P. Corr.
Cos Sim.
41
+0.14
0.03
187
+0.14
0.02
275
+0.10
0.01
Negative Logits
optera
-1.66
characterised
-1.52
cause
-1.47
arrest
-1.47
prescribed
-1.46
Exactly
-1.44
blogger
-1.44
onuclear
-1.41
disambiguation
-1.40
fault
-1.40
POSITIVE LOGITS
erie
1.87
derr
1.86
oir
1.83
uet
1.81
dez
1.74
ved
1.71
asek
1.66
adin
1.64
inux
1.61
gus
1.60
Activations Density 0.150%