INDEX
Explanations
company and technology-related terms and acronyms
New Auto-Interp
Neuron Alignment
Index
Value
% of L₁
397
+0.20
1.1%
976
+0.14
0.8%
1323
+0.12
0.6%
Correlated Neurons
Index
P. Corr.
Cos Sim.
397
+0.20
0.06
1056
+0.14
0.05
1256
+0.12
0.04
Negative Logits
<bos>
-1.82
ⓧ
-0.72
intersper
-0.56
quitted
-0.53
rejoin
-0.52
harmed
-0.52
reconno
-0.51
magnify
-0.51
Schön
-0.51
flä
-0.51
POSITIVE LOGITS
Sta
1.15
ST
1.11
Sta
1.00
Stu
0.96
st
0.94
sta
0.92
ST
0.92
Moderato
0.87
Hauteur
0.87
Staf
0.86
Activations Density 0.381%