INDEX
Explanations
phrases related to warnings or advice
New Auto-Interp
Neuron Alignment
Index
Value
% of L₁
605
+0.08
0.2%
198
+0.08
0.2%
321
+0.08
0.2%
Correlated Neurons
Index
P. Corr.
Cos Sim.
1851
+0.08
0.04
912
+0.08
0.04
100
+0.08
0.04
Negative Logits
parlamento
-0.57
setLayoutParams
-0.53
PageFactory
-0.52
Siria
-0.51
emmel
-0.51
lccccc
-0.50
Rumania
-0.50
blurRadius
-0.50
Mâ
-0.49
JpaRepository
-0.49
POSITIVE LOGITS
idać
0.66
bandai
0.60
leçon
0.57
pylab
0.55
wako
0.54
asteroide
0.52
underestimate
0.52
créativité
0.52
akong
0.50
épreuve
0.50
Activations Density 0.298%