INDEX
Explanations
references to public education and related resources
New Auto-Interp
Neuron Alignment
Index
Value
% of L₁
50
+0.38
1.4%
1013
+0.11
0.4%
509
+0.11
0.4%
Correlated Neurons
Index
P. Corr.
Cos Sim.
1336
+0.38
0.07
1533
+0.11
0.03
527
+0.11
0.05
Negative Logits
<bos>
-1.61
contextes
-0.57
repress
-0.55
ⓧ
-0.55
disarm
-0.53
-0.52
dampen
-0.52
liker
-0.52
neutralize
-0.51
lubric
-0.51
POSITIVE LOGITS
Jambi
1.17
Minang
1.15
Karang
1.12
Banjar
1.12
Pekan
1.09
Lampung
1.08
Tanjung
1.07
Muhamma
1.06
Palembang
1.05
silikon
1.05
Activations Density 1.678%