INDEX
Explanations
references to the brain and its conditions
New Auto-Interp
Neuron Alignment
Index
Value
% of L₁
156
+0.24
1.4%
188
+0.14
0.8%
71
+0.14
0.8%
Correlated Neurons
Index
P. Corr.
Cos Sim.
188
+0.24
0.01
446
+0.14
0.01
299
+0.14
0.01
Negative Logits
ment
-1.74
ments
-1.67
aturday
-1.59
Khan
-1.58
umbent
-1.57
sible
-1.53
ftware
-1.48
online
-1.41
ÏĮÏĦι
-1.40
haste
-1.39
POSITIVE LOGITS
stem
2.11
storm
1.99
wash
1.77
iac
1.71
parench
1.70
waves
1.65
wide
1.62
scape
1.61
sheet
1.58
washed
1.53
Activations Density 0.051%