INDEX
Explanations
commands and code snippets within text
New Auto-Interp
Neuron Alignment
Index
Value
% of L₁
1145
+0.17
0.8%
1127
+0.15
0.7%
1983
+0.14
0.7%
Correlated Neurons
Index
P. Corr.
Cos Sim.
981
+0.17
0.05
1145
+0.15
0.04
1177
+0.14
0.02
Negative Logits
COP
-0.51
cop
-0.47
—
-0.47
شن
-0.44
Sk
-0.43
COP
-0.43
Sik
-0.42
DataLoader
-0.42
COPD
-0.42
voiture
-0.42
POSITIVE LOGITS
ļ
0.83
paff
0.78
dispen
0.78
alkoh
0.77
hek
0.75
kram
0.75
mī
0.74
ftu
0.74
tille
0.73
mef
0.73
Activations Density 0.294%