INDEX
Explanations
proper nouns related to legal or law enforcement contexts
New Auto-Interp
Neuron Alignment
Index
Value
% of L₁
1974
+0.16
0.9%
966
+0.16
0.9%
1618
+0.14
0.8%
Correlated Neurons
Index
P. Corr.
Cos Sim.
1363
+0.16
0.05
1343
+0.16
0.07
1575
+0.14
0.04
Negative Logits
intersper
-0.87
<bos>
-0.85
unspeak
-0.81
encomp
-0.78
impelled
-0.70
enshr
-0.69
interposed
-0.68
amass
-0.65
disreg
-0.65
indescri
-0.65
POSITIVE LOGITS
Bobby
0.79
Bobby
0.76
NKC
0.73
bobby
0.70
répon
0.68
Toe
0.66
Heiden
0.63
coration
0.63
padx
0.62
úrese
0.62
Activations Density 0.780%