INDEX
Explanations
phrases related to statistical analysis and data interpretation
New Auto-Interp
Neuron Alignment
Index
Value
% of L₁
752
+0.21
0.7%
1967
+0.10
0.3%
897
+0.10
0.3%
Correlated Neurons
Index
P. Corr.
Cos Sim.
752
+0.21
0.04
1837
+0.10
0.04
732
+0.10
0.03
Negative Logits
adecimal
-0.60
stuks
-0.54
ToUse
-0.52
ToRemove
-0.52
relenting
-0.52
<bos>
-0.50
ForUser
-0.50
uwag
-0.50
folgendes
-0.49
Completo
-0.49
POSITIVE LOGITS
patin
0.91
alpes
0.87
isoli
0.82
notor
0.82
tomat
0.81
solidar
0.81
dè
0.81
fono
0.80
gubern
0.80
raste
0.79
Activations Density 0.161%