INDEX
Explanations
statements related to positive feelings and actions associated with being vegan
New Auto-Interp
Neuron Alignment
Index
Value
% of L₁
453
+0.17
0.5%
1577
+0.14
0.4%
184
+0.13
0.4%
Correlated Neurons
Index
P. Corr.
Cos Sim.
453
+0.17
0.03
1471
+0.14
0.03
127
+0.13
0.02
Negative Logits
reluct
-2.47
accla
-2.46
increa
-2.44
squa
-2.43
secon
-2.42
strick
-2.41
fta
-2.40
volunte
-2.39
inev
-2.37
depic
-2.36
POSITIVE LOGITS
IsContent
1.02
CreateTagHelper
0.91
TagMode
0.88
ConstraintMaker
0.80
EndContext
0.79
enumi
0.78
WriteTagHelper
0.75
makeConstraints
0.75
RectangleBorder
0.73
ContentAlignment
0.72
Activations Density 0.045%