INDEX
Explanations
references to legal or criminal activities and investigations
New Auto-Interp
Neuron Alignment
Index
Value
% of L₁
946
+0.13
0.4%
1013
+0.10
0.3%
1499
+0.09
0.3%
Correlated Neurons
Index
P. Corr.
Cos Sim.
946
+0.13
0.06
939
+0.10
0.04
1499
+0.09
0.04
Negative Logits
cushi
-0.67
horrend
-0.50
vuol
-0.49
prouve
-0.49
échou
-0.49
apparti
-0.47
notare
-0.47
ferait
-0.47
ruine
-0.47
migli
-0.47
POSITIVE LOGITS
romptu
0.56
raided
0.54
raid
0.54
atience
0.53
lacable
0.49
etermined
0.49
läm
0.47
Personensuche
0.47
YNAMIC
0.46
raids
0.46
Activations Density 0.339%