INDEX
Explanations
technical concepts or instructions
New Auto-Interp
Neuron Alignment
Index
Value
% of L₁
50
+0.18
1.1%
2011
+0.13
0.7%
169
+0.11
0.7%
Correlated Neurons
Index
P. Corr.
Cos Sim.
169
+0.18
0.03
2011
+0.13
0.03
251
+0.11
0.02
Negative Logits
<bos>
-3.41
/***
-0.99
///**
-0.82
/*!
-0.81
Fordítás
-0.71
<tfoot>
-0.66
ⓧ
-0.65
})();
-0.65
//---
-0.65
<?
-0.64
POSITIVE LOGITS
affor
1.21
increa
1.19
wien
1.19
thut
1.18
concept
1.17
Concept
1.15
scrat
1.15
fta
1.14
impra
1.12
mcdonald
1.11
Activations Density 0.066%