INDEX
Explanations
phrases related to crime and law enforcement
New Auto-Interp
Neuron Alignment
Index
Value
% of L₁
1150
+0.16
0.5%
1445
+0.13
0.4%
1177
+0.13
0.4%
Correlated Neurons
Index
P. Corr.
Cos Sim.
16
+0.16
0.07
752
+0.13
0.05
1445
+0.13
0.07
Negative Logits
gmbh
-0.81
etui
-0.81
capulco
-0.78
benzin
-0.77
klat
-0.76
fote
-0.76
torba
-0.76
boks
-0.75
hek
-0.72
quarelle
-0.72
POSITIVE LOGITS
<bos>
1.01
Viene
0.90
Şi
0.84
Mentre
0.81
Quelques
0.78
quelqu
0.77
Queste
0.76
Atsauces
0.75
Conheça
0.74
Había
0.73
Activations Density 0.551%