INDEX
Explanations
ways and methods of doing things
New Auto-Interp
Neuron Alignment
Index
Value
% of L₁
1983
+0.14
0.4%
1705
+0.10
0.3%
971
+0.10
0.3%
Correlated Neurons
Index
P. Corr.
Cos Sim.
1983
+0.14
0.05
36
+0.10
0.05
1519
+0.10
0.04
Negative Logits
kuku
-0.72
obligator
-0.72
akut
-0.67
mits
-0.67
„,
-0.67
Minang
-0.66
ert
-0.66
Sén
-0.66
domina
-0.65
Lma
-0.65
POSITIVE LOGITS
way
0.99
way
0.93
Way
0.92
ways
0.90
Way
0.89
WAY
0.89
Ways
0.83
ways
0.83
WAYS
0.80
WAY
0.78
Activations Density 0.091%