INDEX
Explanations
criteria or qualifications for membership in certain organizations or groups
New Auto-Interp
Neuron Alignment
Index
Value
% of L₁
766
+0.18
0.7%
1967
+0.18
0.7%
1842
+0.16
0.6%
Correlated Neurons
Index
P. Corr.
Cos Sim.
766
+0.18
0.05
845
+0.18
0.04
919
+0.16
0.01
Negative Logits
himo
-0.65
praktik
-0.64
kriminal
-0.63
republi
-0.58
ekst
-0.58
akut
-0.58
biograf
-0.57
kritis
-0.57
kompati
-0.56
antik
-0.55
POSITIVE LOGITS
AssemblyCulture
0.57
jectures
0.51
saurait
0.49
malheur
0.48
człowie
0.47
affez
0.45
eccell
0.45
fatica
0.44
OSError
0.44
mężczy
0.42
Activations Density 0.555%