INDEX
Explanations
references to ancient religious practices, rituals, and beliefs
New Auto-Interp
Neuron Alignment
Index
Value
% of L₁
184
+0.21
0.7%
872
+0.19
0.6%
764
+0.19
0.6%
Correlated Neurons
Index
P. Corr.
Cos Sim.
872
+0.21
0.03
764
+0.19
0.02
1959
+0.19
0.03
Negative Logits
akut
-0.79
notor
-0.78
demokra
-0.76
panik
-0.75
alkoh
-0.73
mikrofon
-0.73
kriminal
-0.69
praktik
-0.69
foton
-0.68
ekos
-0.68
POSITIVE LOGITS
disagre
1.30
unspeak
1.27
shewn
1.24
withal
1.22
encomp
1.22
unwarran
1.21
indescri
1.20
tolerably
1.20
apprehen
1.20
gaily
1.18
Activations Density 0.105%