INDEX
Explanations
instances where someone is accused or suspected of committing various criminal acts
New Auto-Interp
Neuron Alignment
Index
Value
% of L₁
1520
+0.13
0.4%
1837
+0.13
0.4%
878
+0.12
0.4%
Correlated Neurons
Index
P. Corr.
Cos Sim.
1837
+0.13
0.04
878
+0.13
0.04
1520
+0.12
0.04
Negative Logits
krab
-0.65
kram
-0.64
Ukraina
-0.60
utop
-0.58
besta
-0.58
eko
-0.57
plak
-0.57
Déf
-0.56
conclud
-0.55
optik
-0.55
POSITIVE LOGITS
charge
1.38
charged
1.33
charges
1.31
charge
1.31
Charge
1.30
Charges
1.24
charging
1.24
Charge
1.23
charged
1.19
charges
1.18
Activations Density 0.087%