INDEX
Explanations
verbs related to creating or causing something
New Auto-Interp
Neuron Alignment
Index
Value
% of L₁
1793
+0.11
0.4%
1472
+0.10
0.3%
486
+0.10
0.3%
Correlated Neurons
Index
P. Corr.
Cos Sim.
1793
+0.11
0.09
1472
+0.10
0.06
791
+0.10
0.07
Negative Logits
Dijo
-0.54
carregar
-0.53
Dijo
-0.53
inserir
-0.52
setToolTipText
-0.50
rafra
-0.50
BigNumber
-0.49
antwoorden
-0.48
FlatAppearance
-0.47
Pernambuco
-0.46
POSITIVE LOGITS
MADE
1.05
Made
1.00
make
0.99
MAKE
0.98
MADE
0.97
Made
0.96
made
0.95
made
0.95
Make
0.94
MAKE
0.94
Activations Density 0.196%