INDEX
Explanations
quotations and words with special characters (such as quotation marks and slashes)
New Auto-Interp
Neuron Alignment
Index
Value
% of L₁
2019
+0.16
0.5%
736
+0.12
0.3%
50
+0.09
0.3%
Correlated Neurons
Index
P. Corr.
Cos Sim.
736
+0.16
0.04
545
+0.12
0.03
1541
+0.09
0.03
Negative Logits
impra
-1.78
snoopy
-1.75
reluct
-1.74
strick
-1.72
increa
-1.72
depic
-1.71
scrat
-1.70
secon
-1.68
shenan
-1.66
disagre
-1.65
POSITIVE LOGITS
BoxShadow
0.81
MouseAdapter
0.80
سكانية
0.71
astify
0.70
Trotz
0.70
ideolog
0.70
bibfnamefont
0.68
nueces
0.66
nasel
0.66
TextInputLayout
0.65
Activations Density 0.167%