INDEX
Explanations
information related to personal stories, sports achievements, and recovery journeys
New Auto-Interp
Neuron Alignment
Index
Value
% of L₁
1013
+0.12
0.4%
194
+0.10
0.3%
509
+0.09
0.3%
Correlated Neurons
Index
P. Corr.
Cos Sim.
194
+0.12
0.03
509
+0.10
0.07
1317
+0.09
0.06
Negative Logits
mef
-1.03
daf
-1.01
glau
-1.00
dises
-0.95
wien
-0.94
auri
-0.94
hcm
-0.93
uncin
-0.92
lein
-0.92
stefan
-0.92
POSITIVE LOGITS
<bos>
0.83
helped
0.60
setGeometry
0.59
my
0.57
experiences
0.53
overcame
0.53
NewUrlParser
0.53
learnings
0.53
lessons
0.53
became
0.52
Activations Density 0.689%