INDEX
Explanations
entities related to investigations and examinations, such as investigators, researchers, and experts
New Auto-Interp
Neuron Alignment
Index
Value
% of L₁
513
+0.11
0.4%
1865
+0.10
0.3%
889
+0.09
0.3%
Correlated Neurons
Index
P. Corr.
Cos Sim.
1865
+0.11
0.04
484
+0.10
0.04
513
+0.09
0.04
Negative Logits
imprend
-0.56
pères
-0.55
Pagina
-0.53
ambassade
-0.48
anivers
-0.48
Volumen
-0.47
zette
-0.47
electrica
-0.46
prêtres
-0.46
serviceName
-0.46
POSITIVE LOGITS
alha
0.65
naer
0.58
apprehen
0.57
saad
0.57
withal
0.56
którzy
0.56
pecified
0.56
kelle
0.55
investors
0.55
negotiators
0.55
Activations Density 0.255%