INDEX
Explanations
No Explanations Found
New Auto-Interp
Neuron Alignment
Index
Value
% of L₁
427
+0.17
1.0%
54
+0.13
0.7%
98
+0.11
0.6%
Correlated Neurons
Index
P. Corr.
Cos Sim.
0
-0.17
0.00
1
-0.13
0.00
2
-0.11
0.00
Negative Logits
TypeDef
-1.75
ophila
-1.66
alg
-1.63
StackTrace
-1.61
och
-1.57
00001
-1.48
elen
-1.47
Classes
-1.45
ETHOD
-1.45
oir
-1.43
POSITIVE LOGITS
ħ
1.79
directed
1.73
Ģ
1.71
1.63
↵
1.63
↵
1.63
1.63
↵↵
1.63
<|outofrange|>
1.63
↵
1.63
Activations Density 0.000%
No Known Activations
This feature has no known activations.