INDEX
Explanations
occurrences of the word "manual" in various contexts
New Auto-Interp
Neuron Alignment
Index
Value
% of L₁
50
+0.27
1.4%
1092
+0.13
0.7%
1233
+0.13
0.6%
Correlated Neurons
Index
P. Corr.
Cos Sim.
1233
+0.27
0.02
1044
+0.13
0.02
1092
+0.13
0.02
Negative Logits
<bos>
-2.18
enumerate
-0.55
transform
-0.55
share
-0.54
宿
-0.54
transform
-0.53
Smith
-0.52
collaborate
-0.52
/*++
-0.52
<?
-0.51
POSITIVE LOGITS
MANUAL
1.16
Minang
1.15
Manuals
1.13
Manual
1.13
manual
1.10
Manual
1.07
Compañ
1.06
ecru
1.05
manuals
1.04
Augu
1.00
Activations Density 0.059%