INDEX
Explanations
terms related to cognitive abilities and tasks
New Auto-Interp
Neuron Alignment
Index
Value
% of L₁
1870
+0.18
0.7%
687
+0.18
0.6%
1983
+0.16
0.6%
Correlated Neurons
Index
P. Corr.
Cos Sim.
687
+0.18
0.02
1983
+0.18
0.02
1065
+0.16
0.02
Negative Logits
Darío
-0.55
philanth
-0.54
philo
-0.54
Valentín
-0.52
incarcer
-0.52
Cormack
-0.51
dramatist
-0.50
ritratto
-0.50
Alcalde
-0.50
pamph
-0.49
POSITIVE LOGITS
cognitive
1.14
Cognitive
1.08
Cognitive
1.05
cognitive
0.99
cognition
0.82
nitive
0.81
Cog
0.75
cogni
0.73
Cog
0.72
kog
0.70
Activations Density 0.084%