INDEX
Explanations
verbs related to actions or events
New Auto-Interp
Neuron Alignment
Index
Value
% of L₁
1602
+0.13
0.4%
1325
+0.12
0.4%
1983
+0.12
0.4%
Correlated Neurons
Index
P. Corr.
Cos Sim.
1325
+0.13
0.05
569
+0.12
0.08
690
+0.12
0.04
Negative Logits
silikon
-0.61
conferir
-0.60
ekster
-0.59
clicar
-0.59
kriminal
-0.58
algu
-0.58
poliuret
-0.58
reconnaît
-0.58
ideolog
-0.57
Kä
-0.57
POSITIVE LOGITS
hairc
1.25
inext
1.21
madonna
1.13
pegasus
1.12
swarovski
1.12
outlander
1.10
jurassic
1.10
ecru
1.10
snoopy
1.08
impra
1.08
Activations Density 0.751%