INDEX
Explanations
topics related to classic literature and movies
New Auto-Interp
Neuron Alignment
Index
Value
% of L₁
674
+0.22
0.8%
964
+0.19
0.7%
1967
+0.11
0.4%
Correlated Neurons
Index
P. Corr.
Cos Sim.
62
+0.22
0.05
198
+0.19
0.05
599
+0.11
0.07
Negative Logits
unspeak
-0.80
Illus
-0.77
Byp
-0.75
Varies
-0.74
Moderately
-0.72
Considerable
-0.71
Incidentally
-0.69
Applicability
-0.68
shenan
-0.68
indescri
-0.67
POSITIVE LOGITS
ⓧ
0.89
<bos>
0.81
<?
0.72
endwhile
0.70
Meksiku
0.70
expandindo
0.69
/**
0.67
0.65
Obrázky
0.61
tristesse
0.61
Activations Density 1.187%