INDEX
Explanations
information related to official names and designations
New Auto-Interp
Neuron Alignment
Index
Value
% of L₁
1194
+0.14
0.8%
30
+0.14
0.8%
316
+0.11
0.6%
Correlated Neurons
Index
P. Corr.
Cos Sim.
1363
+0.14
0.07
1194
+0.14
0.07
227
+0.11
0.09
Negative Logits
<bos>
-2.20
-0.86
/**
-0.85
<?
-0.84
ⓧ
-0.78
<?
-0.66
/*
-0.64
apply
-0.56
нился
-0.53
jsdelivr
-0.52
POSITIVE LOGITS
napoli
1.04
santiago
1.01
sovere
0.96
ricardo
0.91
sergio
0.89
fernando
0.89
eduardo
0.88
valencia
0.88
toledo
0.88
bandung
0.88
Activations Density 0.859%