INDEX
Explanations
movie titles in a list or catalog format
New Auto-Interp
Neuron Alignment
Index
Value
% of L₁
1343
+0.18
0.6%
2019
+0.17
0.5%
453
+0.14
0.5%
Correlated Neurons
Index
P. Corr.
Cos Sim.
764
+0.18
0.04
453
+0.17
0.04
1343
+0.14
0.04
Negative Logits
apprehen
-0.81
gaily
-0.74
impractica
-0.73
shenan
-0.72
unlaw
-0.71
murmuring
-0.71
rascal
-0.70
impelled
-0.69
reluct
-0.66
reconno
-0.65
POSITIVE LOGITS
Formazione
0.74
Caratter
0.72
Oltre
0.64
Voci
0.61
Kategor
0.61
Caratteristiche
0.58
ejus
0.57
Più
0.57
Inoltre
0.56
nomine
0.56
Activations Density 0.152%