INDEX
Explanations
instances of the word "plug" or phrases related to devices being connected
New Auto-Interp
Neuron Alignment
Index
Value
% of L₁
204
+0.15
0.6%
214
+0.14
0.5%
1194
+0.12
0.5%
Correlated Neurons
Index
P. Corr.
Cos Sim.
204
+0.15
0.02
214
+0.14
0.02
1194
+0.12
0.02
Negative Logits
alkoh
-0.81
kriminal
-0.79
panik
-0.77
kaos
-0.75
seksi
-0.72
kosme
-0.72
konserv
-0.70
kooper
-0.70
karton
-0.69
radikal
-0.68
POSITIVE LOGITS
plug
1.63
plugs
1.47
plug
1.44
Plug
1.41
plugged
1.32
plugin
1.30
Plug
1.30
plugging
1.29
plugs
1.25
PLUG
1.22
Activations Density 0.079%