INDEX
Explanations
structured JSON elements within a document
New Auto-Interp
Neuron Alignment
Index
Value
% of L₁
453
+0.15
0.5%
876
+0.11
0.3%
1343
+0.10
0.3%
Correlated Neurons
Index
P. Corr.
Cos Sim.
453
+0.15
0.03
615
+0.11
0.01
1780
+0.10
0.02
Negative Logits
McLaugh
-0.96
ujedno
-0.91
kompres
-0.82
kriminal
-0.79
silikon
-0.79
mikrofon
-0.79
unlaw
-0.78
Vaugh
-0.76
konflik
-0.75
Punj
-0.75
POSITIVE LOGITS
bonjour
0.85
jaja
0.81
nè
0.77
appétit
0.76
desideri
0.74
?</
0.74
ã
0.73
frambo
0.73
?«
0.72
diable
0.71
Activations Density 0.076%