INDEX
Explanations
explanations or reasons for phenomena
New Auto-Interp
Neuron Alignment
Index
Value
% of L₁
764
+0.08
0.2%
752
+0.08
0.2%
60
+0.07
0.2%
Correlated Neurons
Index
P. Corr.
Cos Sim.
1209
+0.08
0.05
60
+0.08
0.05
1521
+0.07
0.03
Negative Logits
ghijklmnop
-0.61
astify
-0.59
InputTagHelper
-0.57
ioutil
-0.57
ExecuteAsync
-0.54
wikk
-0.53
FetchType
-0.53
ThroughAttribute
-0.53
Geplaatst
-0.52
WireFormat
-0.52
POSITIVE LOGITS
encomp
0.86
intersper
0.85
increa
0.80
tucson
0.77
toledo
0.74
orlando
0.71
amsterdam
0.70
bangkok
0.70
shenan
0.69
cuck
0.69
Activations Density 0.346%