INDEX
Explanations
occurrences of the command "fi."
New Auto-Interp
Neuron Alignment
Index
Value
% of L₁
376
+0.13
0.8%
82
+0.12
0.7%
260
+0.12
0.7%
Correlated Neurons
Index
P. Corr.
Cos Sim.
82
+0.13
0.02
37
+0.12
0.01
260
+0.12
0.02
Negative Logits
»¿
-1.78
Īĺ
-1.68
ĨĴ
-1.66
tolerated
-1.66
©
-1.64
SO
-1.60
NO
-1.55
HAS
-1.43
iply
-1.39
user
-1.38
POSITIVE LOGITS
ÅĽci
1.96
ery
1.91
”:
1.81
”—
1.79
âĢIJ
1.78
xtures
1.68
enda
1.66
endas
1.65
establ
1.63
endo
1.62
Activations Density 0.014%