INDEX
Explanations
patterns related to user interaction with newsletters or subscriptions
New Auto-Interp
Neuron Alignment
Index
Value
% of L₁
674
+0.17
0.5%
1343
+0.14
0.4%
453
+0.12
0.3%
Correlated Neurons
Index
P. Corr.
Cos Sim.
1702
+0.17
0.02
1237
+0.14
0.02
595
+0.12
0.02
Negative Logits
getCity
-0.63
McLaugh
-0.62
barbarous
-0.56
userModel
-0.56
asmuch
-0.55
itemList
-0.55
Punj
-0.54
unspeak
-0.54
liberality
-0.54
olsom
-0.53
POSITIVE LOGITS
anse
0.83
obb
0.82
cyr
0.78
scol
0.78
rege
0.75
obiet
0.74
attes
0.73
affez
0.72
reger
0.72
obligator
0.72
Activations Density 0.047%