INDEX
Explanations
words related to the exploitation of animals for various purposes
New Auto-Interp
Neuron Alignment
Index
Value
% of L₁
198
+0.13
0.4%
1385
+0.12
0.4%
1842
+0.10
0.3%
Correlated Neurons
Index
P. Corr.
Cos Sim.
923
+0.13
0.03
1857
+0.12
0.02
1293
+0.10
0.04
Negative Logits
pamph
-0.96
quitted
-0.86
apprehen
-0.81
reconno
-0.81
Moslem
-0.72
gaily
-0.72
Minang
-0.71
marcato
-0.71
Shakspeare
-0.71
shenan
-0.70
POSITIVE LOGITS
szóci
0.52
annica
0.49
\%$\\
0.46
parfum
0.45
used
0.44
askets
0.44
dishes
0.43
salads
0.43
FUNERAL
0.43
used
0.42
Activations Density 0.315%