INDEX
Explanations
phrases related to production, ordering, and customization of items
New Auto-Interp
Neuron Alignment
Index
Value
% of L₁
1177
+0.11
0.3%
1978
+0.10
0.3%
946
+0.08
0.2%
Correlated Neurons
Index
P. Corr.
Cos Sim.
1586
+0.11
0.03
691
+0.10
0.01
1745
+0.08
0.03
Negative Logits
meras
-0.70
kosme
-0.69
LookAnd
-0.69
minimalis
-0.69
melat
-0.66
lomb
-0.66
radikal
-0.66
PhysRevD
-0.66
Meksiko
-0.66
kalori
-0.66
POSITIVE LOGITS
shenan
0.87
hentai
0.82
depic
0.82
sophistic
0.80
pamph
0.78
yoda
0.76
indestru
0.75
impra
0.75
McInt
0.74
unspeak
0.73
Activations Density 0.403%