INDEX
Explanations
references to electronic devices, especially laptops
New Auto-Interp
Neuron Alignment
Index
Value
% of L₁
1637
+0.14
0.8%
1103
+0.14
0.7%
795
+0.13
0.7%
Correlated Neurons
Index
P. Corr.
Cos Sim.
795
+0.14
0.02
1385
+0.14
0.03
1637
+0.13
0.02
Negative Logits
<bos>
-1.89
encomp
-1.07
/**
-1.02
cushi
-1.01
intersper
-0.98
hairc
-0.97
affor
-0.93
<?
-0.93
unspeak
-0.93
milf
-0.93
POSITIVE LOGITS
laptop
1.19
Lap
1.09
lap
1.06
Lap
1.06
lap
1.04
Laptop
1.03
laptops
1.03
Laptop
1.00
laptop
0.98
LAP
0.90
Activations Density 0.237%