INDEX
Explanations
phrases related to resistance or objection
New Auto-Interp
Neuron Alignment
Index
Value
% of L₁
1055
+0.10
0.3%
1842
+0.10
0.3%
1150
+0.09
0.3%
Correlated Neurons
Index
P. Corr.
Cos Sim.
1055
+0.10
0.05
508
+0.10
0.06
178
+0.09
0.04
Negative Logits
aen
-1.04
Confu
-1.03
fta
-1.03
lyon
-1.02
levis
-1.00
loren
-1.00
Juf
-0.99
thut
-0.96
fluo
-0.96
stefan
-0.95
POSITIVE LOGITS
AnchorTagHelper
0.62
توض
0.59
bindingNavigator
0.53
ligiloj
0.52
FetchType
0.52
anelas
0.51
]")]
0.50
pageContext
0.48
.}(
0.48
also
0.48
Activations Density 0.415%