INDEX
Explanations
references to payment and financial transactions
New Auto-Interp
Neuron Alignment
Index
Value
% of L₁
263
+0.21
1.3%
153
+0.19
1.1%
17
+0.15
0.9%
Correlated Neurons
Index
P. Corr.
Cos Sim.
10
+0.21
0.09
263
+0.19
0.14
153
+0.15
0.06
Negative Logits
aged
-1.60
\].
-1.51
ophone
-1.43
cases
-1.41
sciously
-1.36
case
-1.34
harmonic
-1.34
alike
-1.34
wenn
-1.32
\],
-1.31
POSITIVE LOGITS
¡
3.38
ĨĴ
3.33
Īĺ
3.23
ĸ´
3.19
²
3.19
Ļ
3.18
Ķ
3.13
¬
3.13
ĺ
3.12
3.10
Activations Density 4.070%