INDEX
Explanations
references to software applications or utilities
New Auto-Interp
Neuron Alignment
Index
Value
% of L₁
228
+0.14
0.7%
406
+0.13
0.7%
1296
+0.13
0.7%
Correlated Neurons
Index
P. Corr.
Cos Sim.
1343
+0.14
0.09
227
+0.13
0.09
1097
+0.13
0.08
Negative Logits
<bos>
-1.78
vēl
-0.89
iesp
-0.84
ķ
-0.80
īpa
-0.79
šķ
-0.79
daudz
-0.76
vairāk
-0.75
bēr
-0.72
ļ
-0.72
POSITIVE LOGITS
affor
1.59
increa
1.56
impra
1.55
milf
1.54
shenan
1.49
coar
1.49
madonna
1.47
depic
1.47
excru
1.44
lidl
1.44
Activations Density 0.939%