INDEX
Explanations
details related to observing and analyzing data, particularly in educational settings
New Auto-Interp
Neuron Alignment
Index
Value
% of L₁
50
+0.17
1.0%
1323
+0.12
0.7%
662
+0.10
0.6%
Correlated Neurons
Index
P. Corr.
Cos Sim.
1323
+0.17
0.02
662
+0.12
0.02
136
+0.10
0.01
Negative Logits
<bos>
-3.20
ⓧ
-0.80
-0.76
/***
-0.74
<?
-0.73
/*++
-0.67
/**
-0.66
Даль
-0.63
Vegeu
-0.60
protected
-0.58
POSITIVE LOGITS
stockholm
1.67
wien
1.56
strick
1.54
maneu
1.53
secon
1.52
increa
1.51
affor
1.50
effe
1.48
fta
1.48
tew
1.47
Activations Density 0.070%