INDEX
Explanations
instances of the word "packed."
New Auto-Interp
Neuron Alignment
Index
Value
% of L₁
376
+0.19
1.1%
148
+0.12
0.7%
305
+0.12
0.7%
Correlated Neurons
Index
P. Corr.
Cos Sim.
24
+0.19
0.02
255
+0.12
0.01
305
+0.12
0.01
Negative Logits
Month
-1.60
ORS
-1.60
celebrated
-1.58
interested
-1.55
chers
-1.54
likely
-1.54
chance
-1.52
curious
-1.50
friends
-1.49
iginally
-1.49
POSITIVE LOGITS
olini
1.84
offence
1.73
icture
1.64
ĻĤ
1.62
othal
1.55
oddsidemargin
1.54
stab
1.53
ulses
1.49
ounter
1.48
ussion
1.47
Activations Density 0.183%