INDEX
Explanations
descriptions of physical structures and locations
New Auto-Interp
Neuron Alignment
Index
Value
% of L₁
1385
+0.17
0.5%
752
+0.07
0.2%
946
+0.07
0.2%
Correlated Neurons
Index
P. Corr.
Cos Sim.
736
+0.17
0.04
1385
+0.07
0.03
946
+0.07
0.03
Negative Logits
ukunft
-0.66
<bos>
-0.65
tierrez
-0.63
inconce
-0.63
Nguy
-0.58
unspeak
-0.54
coiff
-0.54
redire
-0.54
unwarran
-0.53
zanas
-0.51
POSITIVE LOGITS
écout
0.71
accompagne
0.62
vécu
0.59
choisis
0.58
empêche
0.57
répon
0.56
réunis
0.56
soigne
0.55
terminée
0.55
thick
0.54
Activations Density 0.220%