INDEX
Explanations
statements related to accomplishments or achievements
New Auto-Interp
Neuron Alignment
Index
Value
% of L₁
1746
+0.08
0.2%
872
+0.07
0.2%
113
+0.07
0.2%
Correlated Neurons
Index
P. Corr.
Cos Sim.
1746
+0.08
0.03
513
+0.07
0.02
270
+0.07
0.04
Negative Logits
Nö
-0.75
gesta
-0.71
logis
-0.71
kapag
-0.67
kram
-0.65
Anm
-0.64
kasama
-0.64
makro
-0.63
makita
-0.62
Keny
-0.61
POSITIVE LOGITS
such
0.85
such
0.82
like
0.80
SUCH
0.77
LIKE
0.64
like
0.64
Such
0.63
solch
0.61
zumal
0.60
zoals
0.60
Activations Density 0.338%