INDEX
Explanations
conditional phrases indicating observation or explanation
New Auto-Interp
Neuron Alignment
Index
Value
% of L₁
1510
+0.09
0.2%
1343
+0.07
0.2%
2000
+0.07
0.2%
Correlated Neurons
Index
P. Corr.
Cos Sim.
1801
+0.09
0.01
1136
+0.07
0.02
1551
+0.07
0.02
Negative Logits
Augu
-1.02
EEU
-0.94
Juf
-0.94
unve
-0.93
Simult
-0.93
unlaw
-0.91
Eft
-0.88
increa
-0.88
Intere
-0.88
depic
-0.87
POSITIVE LOGITS
CURLOPT
0.60
InjectAttribute
0.55
understand
0.50
săptăm
0.50
ɵɵ
0.49
PDOException
0.49
understand
0.48
getSystemService
0.48
ContentLoaded
0.48
CURLOPT
0.48
Activations Density 0.103%