INDEX
Explanations
words related to crime and law, especially regarding theft and coercion
New Auto-Interp
Neuron Alignment
Index
Value
% of L₁
1741
+0.21
0.7%
1967
+0.20
0.7%
1385
+0.18
0.6%
Correlated Neurons
Index
P. Corr.
Cos Sim.
1967
+0.21
0.07
16
+0.20
0.08
50
+0.18
0.06
Negative Logits
WithIOException
-0.61
malheureux
-0.61
couteau
-0.61
oiseau
-0.59
vainqueur
-0.58
oeil
-0.58
monstre
-0.58
vété
-0.56
minY
-0.55
doigt
-0.54
POSITIVE LOGITS
Fitment
0.63
MOQ
0.61
entire
0.60
underlying
0.57
same
0.56
destina
0.56
/**
0.55
respective
0.55
palet
0.54
/*
0.53
Activations Density 0.518%