INDEX
Explanations
mentions of technology details and hardware specifications
New Auto-Interp
Neuron Alignment
Index
Value
% of L₁
227
+0.12
0.3%
678
+0.11
0.3%
604
+0.10
0.3%
Correlated Neurons
Index
P. Corr.
Cos Sim.
678
+0.12
0.05
1446
+0.11
0.02
181
+0.10
0.03
Negative Logits
<bos>
-0.82
UnitTesting
-0.79
pamph
-0.70
IsContent
-0.67
InjectAttribute
-0.63
EndContext
-0.63
OGND
-0.62
StoryboardSegue
-0.62
يتيمه
-0.60
AnchorStyles
-0.59
POSITIVE LOGITS
[''
0.85
considér
0.72
affez
0.65
cerchi
0.64
éprou
0.64
trouva
0.64
surpl
0.63
lusso
0.63
obé
0.63
merav
0.62
Activations Density 0.279%