INDEX
Explanations
mentions of tollbooths
New Auto-Interp
Neuron Alignment
Index
Value
% of L₁
410
+0.12
0.4%
1350
+0.10
0.3%
241
+0.09
0.3%
Correlated Neurons
Index
P. Corr.
Cos Sim.
410
+0.12
0.02
1495
+0.10
0.02
1778
+0.09
0.02
Negative Logits
<tr>
-0.74
/**
-0.73
立
-0.73
namespace
-0.72
//
-0.72
*/
-0.72
tie
-0.70
/*
-0.70
WriteLiteral
-0.70
max
-0.69
POSITIVE LOGITS
toll
3.03
Toll
2.86
Toll
2.62
tolls
2.46
toll
2.43
strick
2.11
affor
2.10
stockholm
2.07
maneu
2.03
increa
2.03
Activations Density 0.191%