INDEX
Explanations
information related to data analysis and decision-making
New Auto-Interp
Neuron Alignment
Index
Value
% of L₁
604
+0.12
0.4%
2019
+0.10
0.3%
1373
+0.08
0.3%
Correlated Neurons
Index
P. Corr.
Cos Sim.
1415
+0.12
0.05
707
+0.10
0.04
120
+0.08
0.04
Negative Logits
intersper
-1.69
maneu
-1.63
strick
-1.58
milf
-1.57
depic
-1.56
?...
-1.56
!...
-1.56
snoopy
-1.51
encomp
-1.50
shenan
-1.50
POSITIVE LOGITS
help
0.88
enhance
0.86
enable
0.85
increase
0.80
create
0.79
give
0.77
contribute
0.77
provide
0.76
bring
0.76
prevent
0.76
Activations Density 0.417%