INDEX
Explanations
numerical values indicating player statistics or performance metrics
New Auto-Interp
Neuron Alignment
Index
Value
% of L₁
50
+0.24
0.9%
1978
+0.18
0.7%
1870
+0.11
0.4%
Correlated Neurons
Index
P. Corr.
Cos Sim.
1978
+0.24
0.10
1105
+0.18
0.07
776
+0.11
0.08
Negative Logits
<bos>
-2.09
ⓧ
-0.91
ddelweddau
-0.70
<!--
-0.69
intende
-0.68
ždý
-0.64
endwhile
-0.64
***!
-0.63
Personendaten
-0.63
glMatrixMode
-0.63
POSITIVE LOGITS
maneu
1.18
impra
1.14
affor
0.99
suscep
0.95
excru
0.94
increa
0.91
inev
0.89
🤣🤣
0.89
erad
0.88
tolerably
0.86
Activations Density 0.296%