INDEX
Explanations
descriptions related to physical health issues and personal struggles
New Auto-Interp
Neuron Alignment
Index
Value
% of L₁
509
+0.11
0.3%
630
+0.08
0.2%
1553
+0.08
0.2%
Correlated Neurons
Index
P. Corr.
Cos Sim.
509
+0.11
0.08
1317
+0.08
0.07
1553
+0.08
0.07
Negative Logits
unsplash
-1.14
susun
-1.05
oleo
-0.99
Quod
-0.94
olx
-0.93
silikon
-0.93
unden
-0.93
roberto
-0.93
milano
-0.92
pixabay
-0.92
POSITIVE LOGITS
feeling
0.76
despair
0.72
unable
0.63
feelings
0.62
feel
0.61
feels
0.61
anxiety
0.61
frustrated
0.61
struggle
0.59
hopeless
0.59
Activations Density 0.915%