INDEX
Explanations
information related to weather forecasts and sports game schedules
New Auto-Interp
Neuron Alignment
Index
Value
% of L₁
411
+0.17
0.6%
228
+0.13
0.5%
1306
+0.12
0.5%
Correlated Neurons
Index
P. Corr.
Cos Sim.
411
+0.17
0.03
172
+0.13
0.02
1036
+0.12
0.01
Negative Logits
that
-0.54
just
-0.52
between
-0.51
وذلك
-0.51
іде
-0.51
.
-0.51
through
-0.51
however
-0.50
バリー
-0.50
to
-0.49
POSITIVE LOGITS
shenan
1.14
reluct
1.13
maneu
1.08
emphat
1.07
Chapitre
1.04
Cfr
1.03
increa
1.02
uninten
1.02
milf
1.01
franz
1.01
Activations Density 0.070%