INDEX
Explanations
specific data structure and coding elements
New Auto-Interp
Neuron Alignment
Index
Value
% of L₁
468
+0.16
0.5%
1510
+0.13
0.4%
1699
+0.12
0.4%
Correlated Neurons
Index
P. Corr.
Cos Sim.
468
+0.16
0.04
1510
+0.13
0.03
1317
+0.12
0.04
Negative Logits
Hvor
-0.80
Hvem
-0.78
Jornal
-0.78
Sitio
-0.77
Ekster
-0.75
Flere
-0.75
Specifik
-0.75
Več
-0.73
Podob
-0.73
Hvad
-0.73
POSITIVE LOGITS
fto
1.70
!...
1.65
effe
1.62
„,
1.57
blos
1.53
squa
1.52
sii
1.52
fte
1.52
fta
1.51
?...
1.50
Activations Density 0.234%