INDEX
Explanations
financial terms and terms related to learning or education
New Auto-Interp
Neuron Alignment
Index
Value
% of L₁
674
+0.32
1.2%
1842
+0.24
0.9%
184
+0.20
0.7%
Correlated Neurons
Index
P. Corr.
Cos Sim.
1842
+0.32
0.05
184
+0.24
0.02
1220
+0.20
0.03
Negative Logits
maneu
-0.64
vry
-0.54
resear
-0.52
Oester
-0.51
Heere
-0.51
Karls
-0.50
fortn
-0.50
Öster
-0.49
Leip
-0.48
unve
-0.48
POSITIVE LOGITS
susun
0.66
referrerpolicy
0.65
<bos>
0.64
silang
0.62
pagkak
0.57
OGND
0.57
and
0.56
siyang
0.55
itong
0.54
và
0.53
Activations Density 0.216%