INDEX
Explanations
keywords related to computer programming and technology
New Auto-Interp
Neuron Alignment
Index
Value
% of L₁
573
+0.07
0.2%
1651
+0.07
0.2%
394
+0.07
0.2%
Correlated Neurons
Index
P. Corr.
Cos Sim.
1937
+0.07
0.05
1648
+0.07
0.02
1446
+0.07
0.03
Negative Logits
aen
-0.99
fta
-0.98
thut
-0.98
erec
-0.95
Juf
-0.94
yong
-0.93
huma
-0.93
lein
-0.92
miu
-0.91
wien
-0.90
POSITIVE LOGITS
occurs
0.62
done
0.59
ⓧ
0.59
arrol
0.58
occurred
0.55
conducted
0.55
HasColumnName
0.55
provided
0.54
undertaken
0.54
performed
0.54
Activations Density 0.307%