INDEX
Explanations
expressions related to evaluation or concern
New Auto-Interp
Neuron Alignment
Index
Value
% of L₁
674
+0.14
0.4%
168
+0.12
0.4%
437
+0.11
0.3%
Correlated Neurons
Index
P. Corr.
Cos Sim.
437
+0.14
0.02
168
+0.12
0.02
554
+0.11
0.02
Negative Logits
pylab
-0.76
brooklyn
-0.71
ecru
-0.69
skimage
-0.69
psycopg
-0.68
Necess
-0.68
pymysql
-0.67
Mga
-0.62
heapq
-0.61
orlando
-0.59
POSITIVE LOGITS
utop
0.66
AsUp
0.61
thuy
0.60
lapto
0.59
awtextra
0.58
tuong
0.56
vort
0.54
fars
0.53
Autoritní
0.52
bals
0.52
Activations Density 0.063%