INDEX
Explanations
positive interactions between law enforcement officers and members of the community
New Auto-Interp
Neuron Alignment
Index
Value
% of L₁
509
+0.10
0.3%
74
+0.09
0.2%
964
+0.08
0.2%
Correlated Neurons
Index
P. Corr.
Cos Sim.
509
+0.10
0.06
74
+0.09
0.03
392
+0.08
0.02
Negative Logits
impra
-0.84
overla
-0.83
mef
-0.78
unve
-0.78
impractica
-0.76
compen
-0.76
wherea
-0.75
arbitrar
-0.74
dsg
-0.73
Simult
-0.73
POSITIVE LOGITS
youth
0.57
Brainz
0.53
drugs
0.53
drug
0.51
teenage
0.49
spender
0.49
rehabilitation
0.49
ineno
0.49
initState
0.49
onStop
0.49
Activations Density 0.532%