INDEX
Explanations
terms related to recipe instructions
New Auto-Interp
Neuron Alignment
Index
Value
% of L₁
168
+0.09
0.3%
1677
+0.09
0.3%
1052
+0.09
0.3%
Correlated Neurons
Index
P. Corr.
Cos Sim.
1120
+0.09
0.03
1052
+0.09
0.02
184
+0.09
0.01
Negative Logits
<bos>
-0.90
fix
-0.67
por
-0.61
earn
-0.61
private
-0.60
function
-0.60
put
-0.60
off
-0.59
func
-0.58
go
-0.58
POSITIVE LOGITS
Campbell
1.88
Campbell
1.71
ftu
1.64
depic
1.61
effe
1.61
saar
1.60
maneu
1.58
CAMPBELL
1.57
stockholm
1.55
vns
1.53
Activations Density 0.207%