INDEX
Explanations
references to different types of utensils and kitchen supplies
New Auto-Interp
Neuron Alignment
Index
Value
% of L₁
313
+0.19
1.0%
1296
+0.18
1.0%
1492
+0.15
0.8%
Correlated Neurons
Index
P. Corr.
Cos Sim.
1363
+0.19
0.05
1741
+0.18
-0.01
478
+0.15
0.02
Negative Logits
<bos>
-1.02
提
-0.60
bot
-0.59
bo
-0.58
Đặc
-0.56
pub
-0.56
бо
-0.55
栗
-0.54
bo
-0.54
case
-0.54
POSITIVE LOGITS
Khart
1.09
ftu
1.04
Juf
1.02
unwarran
1.01
centrif
1.00
paradiso
1.00
tranf
1.00
anhyd
0.99
fta
0.99
oleo
0.99
Activations Density 0.847%