INDEX
Explanations
references to digital media platforms and related interactions
New Auto-Interp
Neuron Alignment
Index
Value
% of L₁
369
+0.15
0.9%
302
+0.15
0.9%
410
+0.13
0.8%
Correlated Neurons
Index
P. Corr.
Cos Sim.
57
+0.15
0.11
77
+0.15
0.01
410
+0.13
0.16
Negative Logits
"}](#
-1.98
âĢķ
-1.64
,'"
-1.62
bras
-1.61
,''
-1.61
.[@
-1.60
zzles
-1.55
,[@
-1.48
oretic
-1.45
apers
-1.43
POSITIVE LOGITS
ités
1.40
colleague
1.38
ities
1.38
ité
1.38
actin
1.35
oath
1.33
ciliation
1.32
Guide
1.32
colleagues
1.31
Representative
1.28
Activations Density 5.532%