INDEX
Explanations
mentions of specific crimes, particularly homicides
New Auto-Interp
Neuron Alignment
Index
Value
% of L₁
1253
+0.09
0.3%
1499
+0.09
0.3%
198
+0.09
0.3%
Correlated Neurons
Index
P. Corr.
Cos Sim.
1499
+0.09
0.06
1249
+0.09
0.03
1394
+0.09
0.05
Negative Logits
triomphe
-0.88
shenan
-0.88
!...
-0.85
snoopy
-0.84
maît
-0.84
poff
-0.82
aquarelle
-0.82
Ename
-0.80
vhs
-0.79
wattpad
-0.79
POSITIVE LOGITS
deaths
0.79
homicide
0.72
fatalities
0.69
murders
0.64
incid
0.62
incidence
0.59
statistics
0.59
homic
0.58
fatal
0.58
killings
0.58
Activations Density 0.493%