INDEX
Explanations
occurrences of the number 24
New Auto-Interp
Neuron Alignment
Index
Value
% of L₁
369
+0.17
1.0%
458
+0.15
0.9%
477
+0.13
0.7%
Correlated Neurons
Index
P. Corr.
Cos Sim.
458
+0.17
0.04
445
+0.15
0.04
498
+0.13
0.04
Negative Logits
rition
-1.65
Wars
-1.63
possession
-1.62
World
-1.60
Circuit
-1.54
sale
-1.52
responsibility
-1.50
world
-1.46
parap
-1.44
title
-1.41
POSITIVE LOGITS
³
3.41
ĥ½
3.24
ª
3.23
µ
3.11
®
3.08
·¸
3.06
Ń
2.88
ħ
2.85
¬
2.85
¾
2.85
Activations Density 0.032%