INDEX
Explanations
numbers related to historical events and time periods
New Auto-Interp
Neuron Alignment
Index
Value
% of L₁
537
+0.13
0.5%
1124
+0.12
0.4%
776
+0.12
0.4%
Correlated Neurons
Index
P. Corr.
Cos Sim.
776
+0.13
0.06
537
+0.12
0.04
75
+0.12
0.03
Negative Logits
rcParams
-0.65
alkoh
-0.57
pü
-0.55
ModelForm
-0.54
StartPosition
-0.54
PushButton
-0.53
WebControls
-0.53
Matchers
-0.53
Descripció
-0.53
optik
-0.53
POSITIVE LOGITS
intersper
1.03
overcrow
0.95
apprehen
0.92
shenan
0.84
underval
0.83
stickied
0.81
unspeak
0.80
indestru
0.78
maneu
0.78
downvote
0.78
Activations Density 0.158%