INDEX
Explanations
specific numbers or numerical patterns
New Auto-Interp
Neuron Alignment
Index
Value
% of L₁
50
+0.16
1.0%
1978
+0.13
0.8%
667
+0.12
0.7%
Correlated Neurons
Index
P. Corr.
Cos Sim.
1978
+0.16
0.10
776
+0.13
0.09
321
+0.12
0.07
Negative Logits
<bos>
-3.01
ⓧ
-0.78
GEBURTSDATUM
-0.77
springfox
-0.74
abestanden
-0.74
agrí
-0.73
rawDesc
-0.73
AddTagHelper
-0.72
wireType
-0.70
ivelany
-0.70
POSITIVE LOGITS
maneu
1.64
affor
1.61
impra
1.60
reluct
1.57
unlaw
1.55
increa
1.55
inev
1.48
guarante
1.44
Augu
1.44
strick
1.43
Activations Density 0.275%