INDEX
Explanations
concepts related to relationships and discussions on fidelity and commitment
New Auto-Interp
Neuron Alignment
Index
Value
% of L₁
544
+0.13
0.5%
629
+0.12
0.4%
1870
+0.11
0.4%
Correlated Neurons
Index
P. Corr.
Cos Sim.
544
+0.13
0.03
629
+0.12
0.03
1105
+0.11
0.03
Negative Logits
robus
-0.81
solidar
-0.78
Áng
-0.77
revan
-0.74
viciss
-0.74
diffusi
-0.73
invari
-0.70
aton
-0.69
pessi
-0.69
javier
-0.68
POSITIVE LOGITS
relationship
1.28
relationships
1.22
relationship
1.20
Relationship
1.19
relationships
1.09
Relationships
1.08
Relationship
1.06
Relationships
0.98
RELATIONSHIP
0.90
relations
0.82
Activations Density 0.060%