INDEX
Explanations
references to liquid substances or their properties
New Auto-Interp
Neuron Alignment
Index
Value
% of L₁
156
+0.22
1.3%
304
+0.14
0.8%
495
+0.12
0.7%
Correlated Neurons
Index
P. Corr.
Cos Sim.
304
+0.22
0.02
495
+0.14
0.01
18
+0.12
0.01
Negative Logits
cess
-1.75
ours
-1.61
founder
-1.60
rior
-1.56
passer
-1.47
nae
-1.44
ÏħÏĦ
-1.42
proud
-1.42
TRODUCTION
-1.42
Ñĸд
-1.38
POSITIVE LOGITS
Ĵ
2.07
ľ
1.97
ģ
1.89
itation
1.89
²
1.88
ī
1.87
chromatography
1.87
¿½
1.79
ħ
1.79
»
1.77
Activations Density 0.066%