INDEX
Explanations
references to the Internet and its associated technologies
New Auto-Interp
Neuron Alignment
Index
Value
% of L₁
156
+0.21
1.2%
451
+0.12
0.7%
460
+0.11
0.6%
Correlated Neurons
Index
P. Corr.
Cos Sim.
451
+0.21
0.01
460
+0.12
0.02
173
+0.11
0.01
Negative Logits
drastic
-1.88
uterus
-1.72
Į
-1.71
solitary
-1.66
unts
-1.64
tear
-1.58
same
-1.54
aucoup
-1.53
certain
-1.52
resign
-1.51
POSITIVE LOGITS
Explorer
1.99
borne
1.91
works
1.85
Access
1.79
Gate
1.73
Link
1.73
Resources
1.61
burst
1.59
sphere
1.57
Archive
1.55
Activations Density 0.090%