INDEX
Explanations
mentions of the word "gold" or related terms
New Auto-Interp
Neuron Alignment
Index
Value
% of L₁
596
+0.16
0.6%
390
+0.14
0.6%
1622
+0.13
0.6%
Correlated Neurons
Index
P. Corr.
Cos Sim.
1622
+0.16
0.03
596
+0.14
0.03
390
+0.13
0.03
Negative Logits
Baillargeon
-0.53
mlung
-0.52
verwijspagina
-0.50
InjectAttribute
-0.49
Nationen
-0.49
queryInterface
-0.47
PostExecute
-0.47
fjspx
-0.47
quelize
-0.47
PrototypeOf
-0.47
POSITIVE LOGITS
GOLD
1.26
gold
1.26
Gold
1.22
gold
1.21
Gold
1.18
GOLD
1.07
Goldie
0.88
Goldsmith
0.86
doré
0.85
Goldstein
0.85
Activations Density 0.068%