INDEX
Explanations
mentions of a specific individual named Healy
New Auto-Interp
Neuron Alignment
Index
Value
% of L₁
1548
+0.08
0.2%
1065
+0.08
0.2%
257
+0.07
0.2%
Correlated Neurons
Index
P. Corr.
Cos Sim.
283
+0.08
0.04
1343
+0.08
0.07
1120
+0.07
0.07
Negative Logits
cabul
-0.54
intendent
-0.52
er
-0.50
im
-0.49
vom
-0.49
mability
-0.49
journ
-0.49
-
-0.48
ashier
-0.48
פון
-0.48
POSITIVE LOGITS
aly
1.29
volunte
1.21
guarante
1.11
thut
1.10
disagre
1.08
inev
1.07
quitted
1.06
coö
1.06
kte
1.06
accla
1.05
Activations Density 0.499%