INDEX
Explanations
phrases related to political agreements, negotiations, and conventions
New Auto-Interp
Neuron Alignment
Index
Value
% of L₁
1757
+0.21
0.8%
251
+0.18
0.7%
1145
+0.16
0.6%
Correlated Neurons
Index
P. Corr.
Cos Sim.
251
+0.21
0.03
1757
+0.18
0.03
1145
+0.16
0.02
Negative Logits
mistak
-0.56
horned
-0.51
asserole
-0.49
getIndex
-0.47
uckoo
-0.47
relenting
-0.47
frescoes
-0.47
Eliot
-0.47
horny
-0.46
trit
-0.46
POSITIVE LOGITS
package
1.45
packages
1.36
Package
1.36
Packages
1.27
Package
1.24
package
1.18
PACKAGE
1.11
pack
1.08
packages
1.07
PACKAGE
1.07
Activations Density 0.063%