INDEX
Explanations
terms related to education, training, and workshops
New Auto-Interp
Neuron Alignment
Index
Value
% of L₁
411
+0.18
0.7%
1806
+0.15
0.6%
1870
+0.14
0.5%
Correlated Neurons
Index
P. Corr.
Cos Sim.
411
+0.18
0.04
1806
+0.15
0.04
559
+0.14
0.03
Negative Logits
nicolas
-0.88
vété
-0.81
alberto
-0.80
maneu
-0.79
reluct
-0.77
accla
-0.77
milf
-0.75
yves
-0.75
sergio
-0.74
intersper
-0.74
POSITIVE LOGITS
training
1.31
Training
1.21
training
1.20
Training
1.19
train
1.12
train
1.10
TRAINING
1.09
Train
1.06
Train
1.02
trains
1.01
Activations Density 0.076%