INDEX
Explanations
No Explanations Found
New Auto-Interp
Neuron Alignment
Index
Value
% of L₁
113
+0.13
0.7%
461
+0.12
0.7%
273
+0.12
0.7%
Correlated Neurons
Index
P. Corr.
Cos Sim.
69
+0.13
0.38
151
+0.12
0.31
434
+0.12
0.30
Negative Logits
herself
-1.68
Stats
-1.47
routine
-1.42
foreseeable
-1.38
sie
-1.38
Associates
-1.37
Mae
-1.35
Siem
-1.34
entangled
-1.33
Manning
-1.33
POSITIVE LOGITS
»¿
2.09
ı
2.04
±
2.03
¶
2.00
ľĵ
1.96
¢
1.96
Į
1.90
ij
1.88
³
1.87
Ļª
1.83
Activations Density 1.625%
No Known Activations
This feature has no known activations.