INDEX
Explanations
numbers and specific quantifiers
New Auto-Interp
Neuron Alignment
Index
Value
% of L₁
50
+0.20
0.9%
568
+0.11
0.5%
994
+0.11
0.5%
Correlated Neurons
Index
P. Corr.
Cos Sim.
994
+0.20
0.06
776
+0.11
0.12
1526
+0.11
0.08
Negative Logits
<bos>
-3.04
ⓧ
-0.94
/***
-0.88
//});
-0.84
//{
-0.78
//};
-0.74
})();
-0.70
gynnwys
-0.68
Autoritní
-0.67
Демографія
-0.66
POSITIVE LOGITS
ecru
1.60
tramont
1.59
napoli
1.59
milano
1.56
sappi
1.54
swarovski
1.54
valencia
1.52
bordeaux
1.50
viciss
1.50
bandung
1.46
Activations Density 2.182%