INDEX
Explanations
publication-related information like DOIs, URLs, and publication details
New Auto-Interp
Neuron Alignment
Index
Value
% of L₁
50
+0.17
0.6%
1018
+0.09
0.3%
75
+0.08
0.3%
Correlated Neurons
Index
P. Corr.
Cos Sim.
1927
+0.17
0.03
1213
+0.09
0.03
685
+0.08
0.02
Negative Logits
/**
-0.71
ⓧ
-0.71
///**
-0.68
"..\..\..\
-0.63
setObject
-0.61
="#">
-0.60
AddField
-0.60
||}
-0.60
addGroup
-0.60
}{||-0.59
POSITIVE LOGITS
affor
1.45
véhic
1.44
impra
1.37
pleins
1.34
cushi
1.32
accla
1.30
strick
1.29
rafra
1.29
vété
1.28
tupperware
1.28
Activations Density 0.036%