INDEX
Explanations
phrases related to career changes and personal decisions
New Auto-Interp
Neuron Alignment
Index
Value
% of L₁
50
+0.34
1.3%
1013
+0.16
0.6%
2015
+0.10
0.4%
Correlated Neurons
Index
P. Corr.
Cos Sim.
1862
+0.34
0.07
1013
+0.16
0.09
1217
+0.10
0.05
Negative Logits
<bos>
-1.43
harmonize
-0.56
enshr
-0.56
neutralize
-0.54
qiao
-0.53
cooperated
-0.53
standardize
-0.53
addCriterion
-0.52
unblock
-0.51
localize
-0.51
POSITIVE LOGITS
churrasco
0.96
swarovski
0.92
churras
0.89
paillettes
0.84
ecru
0.84
Bárbara
0.83
grossa
0.81
Bekasi
0.81
pousada
0.81
boulangerie
0.80
Activations Density 1.174%