INDEX
Explanations
phrases related to technology and data analysis
New Auto-Interp
Neuron Alignment
Index
Value
% of L₁
1870
+0.13
0.4%
1385
+0.13
0.4%
577
+0.12
0.4%
Correlated Neurons
Index
P. Corr.
Cos Sim.
914
+0.13
0.03
455
+0.13
0.02
198
+0.12
0.03
Negative Logits
trovar
-0.76
esser
-0.74
michelin
-0.70
exorbit
-0.70
accla
-0.69
fortn
-0.66
encomp
-0.66
imparare
-0.65
disgra
-0.65
arric
-0.65
POSITIVE LOGITS
kony
0.59
fusca
0.57
IntoConstraints
0.56
iveau
0.56
urance
0.55
cso
0.54
high
0.54
FBref
0.52
High
0.52
lele
0.51
Activations Density 0.185%