INDEX
Explanations
references to everyday life activities or products
New Auto-Interp
Neuron Alignment
Index
Value
% of L₁
1837
+0.14
0.6%
25
+0.14
0.6%
50
+0.12
0.5%
Correlated Neurons
Index
P. Corr.
Cos Sim.
1793
+0.14
0.03
1837
+0.14
0.03
841
+0.12
0.02
Negative Logits
<bos>
-1.94
/***
-0.65
HasForeignKey
-0.64
lateinit
-0.59
CodeDom
-0.54
('=-0.54
adopt
-0.53
GenerationType
-0.53
reunite
-0.52
Ciò
-0.51
POSITIVE LOGITS
everyday
1.00
prado
0.94
milano
0.94
Everyday
0.93
pican
0.89
Everyday
0.89
monaster
0.88
Ordinary
0.88
ordinary
0.87
tramont
0.86
Activations Density 0.285%