INDEX
Explanations
sentences related to technology and system operations
New Auto-Interp
Neuron Alignment
Index
Value
% of L₁
2034
+0.16
0.5%
2019
+0.08
0.2%
270
+0.08
0.2%
Correlated Neurons
Index
P. Corr.
Cos Sim.
1959
+0.16
0.06
963
+0.08
0.04
1021
+0.08
0.04
Negative Logits
pernic
-0.67
חיצוניים
-0.64
conclud
-0.63
ideolog
-0.62
notor
-0.60
experim
-0.59
sentito
-0.59
legendar
-0.58
Darío
-0.57
pædia
-0.57
POSITIVE LOGITS
It
0.64
Because
0.63
Allows
0.63
it
0.60
because
0.58
allows
0.56
They
0.56
Gives
0.55
Provides
0.55
it
0.54
Activations Density 0.469%