INDEX
Explanations
words related to electrical or mechanical properties and processes
New Auto-Interp
Neuron Alignment
Index
Value
% of L₁
156
+0.12
0.7%
141
+0.12
0.6%
376
+0.12
0.6%
Correlated Neurons
Index
P. Corr.
Cos Sim.
111
+0.12
0.03
156
+0.12
0.02
271
+0.12
0.01
Negative Logits
Ķ
-1.80
ĥ
-1.73
mares
-1.60
pite
-1.59
yla
-1.58
liers
-1.58
¼
-1.56
decision
-1.56
lier
-1.50
·¸
-1.48
POSITIVE LOGITS
operated
1.68
PROVIDED
1.64
:#
1.63
connected
1.56
connected
1.54
charged
1.50
electric
1.50
wired
1.46
noreply
1.45
aeda
1.42
Activations Density 0.071%