INDEX
Explanations
companies, names, and details related to specific organizations or events
New Auto-Interp
Neuron Alignment
Index
Value
% of L₁
479
+0.16
1.0%
50
+0.13
0.8%
596
+0.12
0.7%
Correlated Neurons
Index
P. Corr.
Cos Sim.
479
+0.16
0.03
1137
+0.13
0.03
1335
+0.12
0.03
Negative Logits
<bos>
-2.77
-0.87
ⓧ
-0.83
SystemColors
-0.74
DataAnnotations
-0.71
<?
-0.69
ൊ
-0.69
Italijani
-0.67
font
-0.67
AllowUser
-0.64
POSITIVE LOGITS
maneu
1.81
inev
1.61
increa
1.55
milano
1.54
impra
1.53
embra
1.52
depic
1.52
excru
1.51
indestru
1.50
emphat
1.49
Activations Density 0.184%