INDEX
Explanations
mathematical or logical symbols and their representations
New Auto-Interp
Neuron Alignment
Index
Value
% of L₁
50
+0.18
1.0%
161
+0.17
1.0%
341
+0.17
0.9%
Correlated Neurons
Index
P. Corr.
Cos Sim.
341
+0.18
0.02
892
+0.17
0.03
1964
+0.17
0.02
Negative Logits
<bos>
-2.59
-0.87
ⓧ
-0.84
contentLoaded
-0.79
<?
-0.77
/**
-0.74
Autoritní
-0.64
Více
-0.64
/*
-0.62
@[+][
-0.61
POSITIVE LOGITS
bandung
1.15
maroc
1.12
">...
1.03
lapin
0.99
milano
0.98
eiffel
0.98
babi
0.98
gmbh
0.98
swarovski
0.97
jawa
0.97
Activations Density 0.067%