INDEX
Explanations
terms related to ownership
New Auto-Interp
Neuron Alignment
Index
Value
% of L₁
50
+0.27
1.6%
544
+0.16
1.0%
1810
+0.14
0.8%
Correlated Neurons
Index
P. Corr.
Cos Sim.
544
+0.27
0.03
1810
+0.16
0.03
1034
+0.14
0.02
Negative Logits
<bos>
-2.68
/***
-0.68
ⓧ
-0.65
<?
-0.60
-0.58
twimg
-0.58
context
-0.56
ProtoMessage
-0.56
//---
-0.55
HasColumnType
-0.55
POSITIVE LOGITS
Minang
1.42
bandung
1.31
riva
1.26
riviera
1.16
Muhamma
1.14
Owner
1.12
saar
1.12
jawa
1.11
Batam
1.10
Owners
1.06
Activations Density 0.029%