INDEX
Explanations
references to baked goods and their preparation
New Auto-Interp
Neuron Alignment
Index
Value
% of L₁
50
+0.32
1.1%
736
+0.09
0.3%
394
+0.09
0.3%
Correlated Neurons
Index
P. Corr.
Cos Sim.
736
+0.32
0.12
823
+0.09
0.06
394
+0.09
0.08
Negative Logits
<bos>
-1.56
/***
-0.98
ⓧ
-0.95
meras
-0.93
ideolog
-0.91
solidar
-0.90
///**
-0.88
ló
-0.87
pól
-0.84
dras
-0.81
POSITIVE LOGITS
shenan
1.17
indestru
1.15
maneu
1.10
reluct
1.02
fortn
0.94
disagre
0.93
apprehen
0.92
gaily
0.91
wherea
0.91
Middles
0.90
Activations Density 1.153%