INDEX
Explanations
punctuation marks, specifically commas
New Auto-Interp
Neuron Alignment
Index
Value
% of L₁
50
+0.23
1.0%
382
+0.14
0.6%
1331
+0.11
0.5%
Correlated Neurons
Index
P. Corr.
Cos Sim.
484
+0.23
0.04
1331
+0.14
0.04
229
+0.11
0.04
Negative Logits
<bos>
-2.57
ⓧ
-0.93
},[])
-0.72
inaugurate
-0.70
intersper
-0.68
-0.66
ratify
-0.64
<?
-0.64
endow
-0.63
supersede
-0.63
POSITIVE LOGITS
bandung
0.97
multicolore
0.95
tibi
0.92
cioc
0.92
Luglio
0.91
cæ
0.90
Giugno
0.90
sirop
0.89
Portugu
0.88
Gestão
0.87
Activations Density 0.056%