INDEX
Explanations
the name "Ryan" in the text
New Auto-Interp
Neuron Alignment
Index
Value
% of L₁
50
+0.13
0.9%
1101
+0.12
0.8%
1921
+0.10
0.7%
Correlated Neurons
Index
P. Corr.
Cos Sim.
1101
+0.13
0.03
1097
+0.12
0.03
1921
+0.10
0.03
Negative Logits
<bos>
-2.91
ⓧ
-1.07
-1.02
<?
-0.99
/**
-0.94
/***
-0.87
<?
-0.79
/*
-0.75
///**
-0.66
AutoScaleMode
-0.65
POSITIVE LOGITS
Ryan
1.29
Ryan
1.25
ryan
1.14
cæ
0.82
Juillet
0.81
pank
0.80
saar
0.79
ryan
0.79
silikon
0.79
Bibl
0.79
Activations Density 0.073%