INDEX
Explanations
code snippets related to software frameworks and configurations
New Auto-Interp
Neuron Alignment
Index
Value
% of L₁
1097
+0.10
0.3%
1407
+0.10
0.3%
499
+0.09
0.3%
Correlated Neurons
Index
P. Corr.
Cos Sim.
876
+0.10
0.00
1871
+0.10
0.03
453
+0.09
0.03
Negative Logits
ujedno
-0.71
MediatR
-0.67
mistak
-0.66
cytoplas
-0.59
McLaugh
-0.58
Stateful
-0.54
plak
-0.54
thinkable
-0.53
relenting
-0.53
Rumania
-0.53
POSITIVE LOGITS
obiet
0.67
JS
0.66
dimenti
0.63
JS
0.61
js
0.60
JavaScript
0.59
Javascript
0.57
pantal
0.57
😭😭
0.56
vogli
0.56
Activations Density 0.164%