INDEX
Explanations
phrases related to workplace organization and time management
New Auto-Interp
Neuron Alignment
Index
Value
% of L₁
1252
+0.09
0.2%
963
+0.08
0.2%
1129
+0.08
0.2%
Correlated Neurons
Index
P. Corr.
Cos Sim.
963
+0.09
0.05
1290
+0.08
0.06
1428
+0.08
0.02
Negative Logits
wherea
-1.20
increa
-1.16
fortn
-1.16
volunte
-1.16
reluct
-1.15
squa
-1.13
accla
-1.12
disagre
-1.12
depic
-1.12
secon
-1.10
POSITIVE LOGITS
convincing
0.84
persuade
0.84
convince
0.83
persuasive
0.76
persuasion
0.71
vincing
0.68
appeal
0.64
thuyết
0.64
transQ
0.63
persuading
0.62
Activations Density 0.644%