INDEX
Explanations
phrases related to personal growth and self-improvement
New Auto-Interp
Neuron Alignment
Index
Value
% of L₁
50
+0.15
0.6%
1705
+0.09
0.4%
2016
+0.09
0.3%
Correlated Neurons
Index
P. Corr.
Cos Sim.
1013
+0.15
0.11
2016
+0.09
0.06
1314
+0.09
0.06
Negative Logits
<bos>
-2.86
/***
-0.61
fulfill
-0.60
//---
-0.59
&
-0.58
rely
-0.58
//...
-0.58
establish
-0.58
//@
-0.57
employ
-0.57
POSITIVE LOGITS
bandung
1.44
soggior
1.42
bayern
1.34
bordeaux
1.30
napoli
1.30
swarovski
1.29
palio
1.28
maroc
1.28
nutella
1.25
paradiso
1.25
Activations Density 1.572%