INDEX
Explanations
information related to publication details, copyright, and contact information for various materials
New Auto-Interp
Neuron Alignment
Index
Value
% of L₁
1506
+0.14
0.9%
805
+0.12
0.7%
50
+0.12
0.7%
Correlated Neurons
Index
P. Corr.
Cos Sim.
1978
+0.14
0.06
1837
+0.12
0.05
1527
+0.12
0.04
Negative Logits
<bos>
-2.72
ⓧ
-1.18
/**
-0.99
<?
-0.88
-0.86
/***
-0.71
disbur
-0.69
circulate
-0.61
endow
-0.59
springfox
-0.59
POSITIVE LOGITS
ananas
0.96
marte
0.96
bayern
0.95
optik
0.93
mikrofon
0.92
maroc
0.91
vasi
0.90
karton
0.89
lele
0.88
makro
0.87
Activations Density 0.136%