INDEX
Explanations
instances of punctuation and format indicators in text
New Auto-Interp
Neuron Alignment
Index
Value
% of L₁
399
+0.12
0.6%
462
+0.11
0.6%
283
+0.11
0.6%
Correlated Neurons
Index
P. Corr.
Cos Sim.
203
+0.12
0.35
23
+0.11
0.31
494
+0.11
0.24
Negative Logits
Explorer
-1.61
amethasone
-1.42
________________
-1.42
MSC
-1.42
aceae
-1.37
onial
-1.37
otech
-1.35
dale
-1.33
hores
-1.33
ayer
-1.33
POSITIVE LOGITS
ķ
4.39
ĸ´
4.09
ĸ
4.04
¿
3.92
£
3.89
ĥ½
3.86
Ī
3.82
¦
3.81
§
3.77
©
3.71
Activations Density 3.129%