INDEX
Explanations
the term "gall," potentially indicating a focus on discussions regarding resilience or boldness
New Auto-Interp
Neuron Alignment
Index
Value
% of L₁
300
+0.14
0.8%
429
+0.14
0.8%
349
+0.13
0.7%
Correlated Neurons
Index
P. Corr.
Cos Sim.
1
+0.14
0.01
260
+0.14
0.01
349
+0.13
0.01
Negative Logits
)){-1.84
slightest
-1.62
cens
-1.61
otherwise
-1.50
certain
-1.49
ores
-1.46
cens
-1.43
arer
-1.42
ges
-1.42
priori
-1.41
POSITIVE LOGITS
stones
2.12
©
2.07
ery
1.97
¢
1.77
phrase
1.72
ERY
1.70
stone
1.64
ableView
1.62
aho
1.61
¾
1.60
Activations Density 0.020%