INDEX
Explanations
HTML and CSS code related to styling and web design properties
New Auto-Interp
Neuron Alignment
Index
Value
% of L₁
453
+0.16
0.5%
876
+0.14
0.4%
1343
+0.13
0.4%
Correlated Neurons
Index
P. Corr.
Cos Sim.
453
+0.16
0.04
876
+0.14
-0.00
2008
+0.13
0.03
Negative Logits
<bos>
-1.12
vainly
-0.57
about
-0.57
former
-0.56
endeavouring
-0.55
unjustly
-0.55
obstinate
-0.53
intrigu
-0.53
and
-0.53
vexed
-0.53
POSITIVE LOGITS
soggior
1.14
ristor
1.13
affez
1.10
sappi
1.08
exé
1.03
tranquillo
0.98
scuro
0.97
germain
0.97
cioc
0.96
camicia
0.96
Activations Density 0.167%