INDEX
Explanations
keywords related to orders and processing
New Auto-Interp
Neuron Alignment
Index
Value
% of L₁
113
+0.15
0.9%
376
+0.12
0.7%
95
+0.11
0.7%
Correlated Neurons
Index
P. Corr.
Cos Sim.
113
+0.15
0.02
460
+0.12
0.02
95
+0.11
0.02
Negative Logits
---|---
-1.65
thens
-1.56
semin
-1.56
rapy
-1.56
))$.
-1.55
---|---|---
-1.55
|$.
-1.54
ffield
-1.54
ktiv
-1.51
$.[]{-1.49
POSITIVE LOGITS
heet
2.28
ģ
1.74
mant
1.71
ingly
1.69
amente
1.69
hold
1.66
etable
1.59
¢
1.57
¾
1.53
orders
1.47
Activations Density 0.014%