INDEX
Explanations
words related to motivational speeches and personal growth
New Auto-Interp
Neuron Alignment
Index
Value
% of L₁
1577
+0.20
0.7%
1343
+0.19
0.7%
764
+0.17
0.6%
Correlated Neurons
Index
P. Corr.
Cos Sim.
764
+0.20
0.03
1577
+0.19
0.04
184
+0.17
0.00
Negative Logits
+#+#
-0.62
Precautionary
-0.57
ActionCreators
-0.57
Dichloropropane
-0.56
PageRoute
-0.55
Abitanti
-0.52
País
-0.52
Demografía
-0.51
Chham
-0.51
<bos>
-0.50
POSITIVE LOGITS
unspeak
1.25
impra
1.24
disagre
1.22
reluct
1.17
YMMV
1.17
disreg
1.17
apprehen
1.17
Mlle
1.16
intersper
1.16
shenan
1.10
Activations Density 0.230%