INDEX
Explanations
words related to legal proceedings and situations, specifically focusing on the word "arrests."
New Auto-Interp
Neuron Alignment
Index
Value
% of L₁
325
+0.15
0.6%
1137
+0.13
0.5%
1872
+0.13
0.5%
Correlated Neurons
Index
P. Corr.
Cos Sim.
325
+0.15
0.05
1872
+0.13
0.04
1137
+0.13
0.04
Negative Logits
<bos>
-0.55
XMLSchema
-0.53
abetes
-0.52
эффици
-0.50
kaynağından
-0.50
contentLoaded
-0.50
ชุด
-0.50
remadura
-0.49
щему
-0.49
בכ
-0.49
POSITIVE LOGITS
dises
1.27
mef
1.26
!...
1.25
ardu
1.23
?...
1.22
milano
1.22
marseille
1.21
embra
1.21
thut
1.18
fta
1.17
Activations Density 0.172%