INDEX
Explanations
legal terms and conditions related to software usage and licensing
New Auto-Interp
Neuron Alignment
Index
Value
% of L₁
205
+0.14
0.8%
2
+0.13
0.7%
71
+0.12
0.7%
Correlated Neurons
Index
P. Corr.
Cos Sim.
371
+0.14
0.02
205
+0.13
0.01
268
+0.12
0.01
Negative Logits
Īĺ
-3.23
ľĵ
-3.21
ĨĴ
-3.19
ĸ
-3.13
¦
-3.11
ı
-3.10
®
-3.07
¡
-3.05
¿½
-3.02
İ
-3.00
POSITIVE LOGITS
imposed
1.75
breach
1.55
↵
1.51
overr
1.51
Âĵ
1.49
implied
1.49
condition
1.46
invalid
1.42
conditions
1.42
effect
1.41
Activations Density 0.048%