INDEX
Explanations
connections and relationships between different elements or entities
New Auto-Interp
Neuron Alignment
Index
Value
% of L₁
1533
+0.11
0.3%
814
+0.09
0.3%
1257
+0.08
0.2%
Correlated Neurons
Index
P. Corr.
Cos Sim.
814
+0.11
0.02
1257
+0.09
0.03
961
+0.08
0.03
Negative Logits
konserv
-0.69
kosme
-0.68
praktik
-0.66
Souha
-0.62
Belén
-0.61
notor
-0.59
Darío
-0.59
Mónica
-0.59
Haci
-0.58
afront
-0.58
POSITIVE LOGITS
connections
0.95
linkages
0.82
linking
0.81
connecting
0.80
interconnected
0.79
interconnection
0.78
connections
0.78
Connections
0.75
connectors
0.74
links
0.74
Activations Density 0.606%