INDEX
Explanations
phrases related to negotiations and interactions with others
New Auto-Interp
Neuron Alignment
Index
Value
% of L₁
687
+0.16
0.6%
752
+0.15
0.5%
1150
+0.11
0.4%
Correlated Neurons
Index
P. Corr.
Cos Sim.
687
+0.16
0.06
752
+0.15
0.04
1984
+0.11
0.05
Negative Logits
geze
-0.56
digress
-0.44
ütt
-0.44
jajaja
-0.43
dolu
-0.42
caufe
-0.42
ineffec
-0.42
rlrl
-0.42
limsy
-0.42
difp
-0.41
POSITIVE LOGITS
AVEC
0.72
Jérusalem
0.71
Meille
0.65
appuy
0.63
Avec
0.61
WITH
0.60
noyau
0.59
trône
0.59
vigueur
0.57
congrès
0.56
Activations Density 0.226%